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0\ (54) Title: COMPOSITIONS AND METHODS FOR THE THERAPY AND DUGNOSIS OF OVAIOAN CANCER 

(57) Abstract Compositions and methods for the therapy and diagnosis of cancer, particulaiiy ovarian cancer, are disclosed. Dius- 
trative compositions comprise one or more ovarian tumor polypeptides, immunogenic portions thereof, poljmucleotides that encode 
^ such polypeptides, antigen presenting cell that expresses such polypeptides, and T cells that are specific for cells expressing such 
^ polypq)tides. The disclosed compositions are useful, for example, in the diagnosis, prevention and/or treatment of diseases, paitic- 
^ ularly ovarian cancer. 
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COMPOSmONS AND METHODS FOR THE THERAPY 
AND DIAGNOSIS OF OVARIAN CANCER 

BACKGROUND OF THE INVENTION 

5 Field of the Inventioii 

The present invention relates generally to ovarian cancer flierapy. The 
invention is more specifically related to polypeptides comprising at least a portion of an 
ovarian carcinoma protein, and to polynucleotides encoding such polypeptides, as well 
as antibodies and immune system cells that specifically recognize such polypeptides. 
10 Such polypeptides, polynucleotides, antibodies and cells may be used in vaccines and 
pharmaceutical compositions for treatment of ovarian cancer. 



Description of Related Art 

Ovarian cancer is a significant health problem for women in the United 

15 States and throughout the world. Although advances have been made in detection and 
Iher^y of tiiis cancer, no vaccine or other universally successfiil mediod for prevention 
or treatment is currently available. Management of the disease currently relies on a 
combination of early diagnosis and aggressive treatment, which may include one or 
more of a variety of treatments such as surgery, radiotheraqpy, chemotherapy and 

20 hormone therapy. The course of treatment for a particular cancer is often selected based 
on a variety of prognostic parameters, includinjg an analysis of specific tumor nwrkers. 
However, the use of established markers often leads to a result that is difficult to 
interpret, and high mortality continues to be observed in many cancer patients. 

Immunotherapies have the potential to substantially improve cancer 

25 treatment and survival. Such therapies may involve the generation or enhancement of 
an immune response to an ovarian carcinoma antigen. However, to date, relatively few 
ovarian carcinoma antigens are known and the generation of an immune response 
against such andg^ has not been shown to be therapeutically beneficial. 
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Accordingly, there is a need in die art fitr iniproved methods for 
identifyiog ovarian tumor antigens and for using such antigens in flie Aerapy of ovarian 
cancer. The present invention fulfills these needs and fijrther provides other related 
advantages. 

5 

BRIEF SUMMARY OF THE INVENTION 

Briefly stated, this invention provides compositions and mdhods for the 
ther^y of cancer, such as ovarian cancer. 

In one aspect, the present invoition provides polynucleotide 
1 0 compositions comprising a sequence selected fix>m the group consisting of: 

(a) sequences provided in SEQ ID N0:1, 2, 5, 9, 10, 13, 16, 19, 23, 
27, 28, 32, 33, 35, 38, 41-50, 52, 53, 56, 57, 63, 65, 69-72, 75, 78, 80-82, 84, 86, 89-93, 
95, 97-100, 103, 107, 111, 114, 117, 120, 121, 125, 128, 132-134, 136, 137, 140, 143- 
146, 148-151, 156, 158, 160-162, 166-168, 171, 174-183, 185, 193-199, 203-206, and 

15 208; 

(b) complements of the sequences provided in SEQ ID N0:1, 2, 5, 9, 
10, 13, 16, 19, 23, 27, 28, 32, 33, 35, 38, 41-50, 52, 53, 56, 57, 63, 65, 69-72, 75, 78, 
80-82, 84, 86, 89-93, 95, 97-100, 103, 107, 111, 114, 117, 120, 121, 125, 128, 132-134, 
136, 137, 140, 143-146. 148-151, 156, 158, 160-162, 166-168, 171, 174-183, 185, 193- 

20 199, 203-206, and 208; 

(c) sequences conasting of at least 20 contiguous residues of a 
sequence provided m SEQ ID N0:1, 2, 5, 9, 10, 13, 16, 19, 23, 27, 28, 32, 33, 35, 38, 
41-50, 52, 53, 56, 57, 63, 65, 69-72, 75, 78, 80-82, 84, 86, 89-93, 95, 97-100, 103, 107, 
111, 114, 117. 120, 121, 125, 128, 132-134, 136, 137, 140, 143-146, 148-151, 156, 158, 

25 160-162, 166-168, 171, 174-183, 185, 193-199, 203-206, and 208; 

(d) sequences that hybridize to a sequence provided in SEQ ID 
N0:1, 2, 5, 9, 10, 13, 16, 19, 23, 27, 28, 32, 33, 35, 38, 41-50, 52, 53, 56, 57, 63, 65, 
69-72, 75, 78, 80-82, 84, 86, 89-93, 95, 97-100, 103, 107, 111, 114, 117, 120, 121, 125, 
128, 132-134, 136, 137, 140, 143-146, 148-151, 156, 158, 160-162, 166-168, 171, 174- 

30 183, 185, 193-199, 203-206, and 208 under moderately stcin^t conditions; 
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(e) sequences having at least 75% identity to a sequence provided in 
SEQ ID N0:1, 2, 5, 9, 10, 13, 16, 19, 23, 27, 28, 32, 33, 35, 38, 41-50, 52, 53, 56, 57, 
63, 65, 69-72, 75, 78, 80-82, 84, 86, 89-93, 95, 97-100, 103, 107, 111, 114, 117, 120, 
121, 125, 128, 132-134, 136, 137, 140, 143-146, 148-151, 156, 158, 160-162, 166-168, 

5 171, 174-183, 185, 193-199, 203-206, 208, and 210-214; 

(f) sequences having at least 90% identity to a sequence provided in 
SEQ ID N0:1, 2, 5. 9, 10, 13, 16. 19, 23, 27, 28, 32, 33, 35, 38, 41-50, 52, 53, 56, 57, 
63, 65, 69-72, 75, 78, 80-82, 84, 86, 89-93, 95, 97-100, 103, 107, 111, 114, 117, 120, 
121, 125, 128, 132-134, 136, 137, 140, 143-146, 148-151, 156, 158, 160-162, 166-168, 

10 171, 174-183, 185, 193-199, 203-206, 208 and 210-214; and 

(g) degenerate variants of a sequence provided in SEQ ID N0:1, 2, 
5, 9, 10, 13, 16, 19, 23, 27, 28, 32, 33, 35, 38, 41-50, 52, 53, 56, 57, 63, 65, 69-72, 75, 
78, 80-82, 84, 86, 89-93, 95, 97-100, 103, 107, 111, 114, 117, 120, 121, 125, 128, 132- 
134, 136, 137, 140, 143-146, 148-151, 156, 158, 160-162, 166-168, 171, 174-183, 185, 

15 193-199, 203-206, 208 and 210-214. 

In one preferred embodiment, the polynucleotide compositions of the 
invention are expressed in at least about 20%, more preferably in at least about 30%, 
and most preferably in at least about 50% of ovarian tumors samples tested, at a level 
that is at least about 2-fold, preferably at least about 5-fold, and most preferably at least 

20 about 10-fold higher than that for normal tissues. 

In one aspect, Ihe present invention provides polypeptides comprising an 
hnmunogenic portion of an ovarian carcmoma protein, or a variant thereof that difTers 
in one or more substitutions, deletions, additions and/or insertions such that the ability 
of the variant to react with ovarian carcinoma protein-specific antisera is not 

25 substantially diminished Within certain embodiments, the ovarian carcinoma protein 
comprises a sequence that is encoded by a polynucleotide sequence selected from the 
group consisting of SEQ ID N0:1, 2, 5, 9, 10, 13, 16, 19, 23, 27, 28, 32, 33, 35, 38, 41- 
50, 52, 53, 56, 57, 63, 65, 69-72, 75, 78, 80-82, 84, 86, 89-93, 95, 97-100, 103, 107, 
111, 114, 117, 120, 121, 125, 128, 132-134, 136, 137, 140, 143-146, 148-151, 156, 158, 

30 160-162, 166-168, 171, 174-183, 185, 193-199, 203-205, 208 and 210-214, and 
complements of such polynucleotides. 
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The present invention further provides polynucleotides that encode a 
polypeptide as described above or a portion tiiereof, expression vectors comprising such 
polynucleotides and host cells transformed or transfected with such expression vectors. 

The present invention further provides polypeptide compositions 
5 comprising an amino acid sequence selected from the group consisting of sequences 
recited in SEQ ID NO:200-202, 207, 209 and 21 5. 

In certain pref^ed embodiments, the polypeptides of the preset 
invention are immunogenic, they are capable of eliciting an immune response, 
particularly a humoral and/or cellular immune response, as further described herein. 
10 The present invention further provides fiagments, variants and/or 

derivatives of the disclosed polypeptide sequences, wherein the fragments, variants 
and/or derivatives preferably have a level of immunogenic activity of at least about 
50%, preferably at least about 70% and more preferably at least about 90% of the level 
of immimogenic activity of a the ovarian carcinoma protein comprises an amino acid 
15 sequence encoded by a polynucleotide that comprises a sequence recited in any one of 
SEQIDNO:l-185, 187-199, 203-206, 208 and 210-214. 

Within other aspects, the present invention provides pharmaceutical 
compositions comprising a polypeptide and/or polynucleotide as described above and a 
physiologically acceptable carrier. 
20 Witiun a related aspect of the preset invention^ the pharmaceutical 

compositions, vaccine compositions, are provided for prophylactic or therapeutic 
applications. Such compositions generally comprise an immunogenic polypeptide or 
polynucleotide of the invention and an immunostimulant, such as an adjuvant 

The present invention further provides pharmaceutical compositions that 
25 comprise: (a) an antibody or antigen-binding fragment thereof that specifically binds to 
a polypeptide of the present invention, or a fragment thereof; and (b) a physiologically 
acceptable carrier. 

Within further aspects, the present invention provides pharmaceutical 
compositions comprising: (a) an antigen presenting cell that expresses a polypeptide as 
30 described above and (b) a pharmaceutically acceptable carrier or excipient Illustrative 
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antigen presenting cells include dendritic cells, macrophages, monocytes, fibroblasts 
and B cells. 

Within related aspects, pharmaceutical compositions are provided that 
comprise: (a) an antigen presenting cell that expresses a polypeptide as described above 
5 and (b) an immunostimulant. 

The present invention further provides, in other aspects, fusion proteins 
that comprise at least one polypeptide as described above, as well as polynucleotides 
encoding such fusion proteins, typically in the form of pharmaceutical compositions, 
e.g., vaccine compositions, comprising a physiologically acceptable carrier and/or an 
10 immunostimulant The fusions proteins may comprise multiple immunogenic 
polypeptides or portions/variants theieoi^ as described herein, and may further comprise 
one or more polypeptide segments for facilitating the e?cpression, purification and/or 
immunogenicity of the polypeptide(s). 

Within further aspects, the present invention provides methods for 
15 stimulating an immune response in a patient, preferably a T cell response in a human 
patient, comprising administering a pharmaceutical composition described herein. The 
patient may be afflicted with ovarian cancer, in which case the methods provide 
treatment for the disease, or patient considered at risk for such a disease may be treated 
prophylactically. 

20 Within finther aspects, the present invention provides methods for 

inhibitmg the development of a cancer in a patient, comprising administering to a 
patient a pharmaceutical composition as recited above. The patient may be afflicted 
with ovarian cancer, in which case the methods provide treatment for the disease, or 
patient considered at risk for such a disease may be treated prophylactically. 

25 The presmt invention further provides, within other aspects, methods for 

removing tumor cells from a biological sample, comprising contacting a biological 
sample with T cells that specifically react with a polypeptide of the present invention, 
wherein the step of contacting is performed under conditions and for a time sufficient to 
permit the removal of cells expressing the protein from the sample. 
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Within related aspects, metiiods are provided for inhibiting the 
development of a cancer in a patient, comprising administering to a patient a biological 
sample treated as described above. 

Methods are further provided, within other aspects, for stimulating 
5 and/or expanding T cells specific for a polypeptide of the present invention, comprising 
contacting T cells with one or more of: (i) an ovarian carcinoma polypeptide as 
described above; (ii) a polynucleotide encoding such a polypeptide; and/or (iii) an 
antigen presenting cell that expresses such a polypeptide; under conditions and for a 
time sufficient to permit the stimulation and/or expansion of T ceUs. Isolated T cell 
10 populations comprising T cells prepared as described above are also provided 

Within further aspects, the present invention provides mefliods for 
inhibiting the development of a cancer in a patient, comprising administering to a 
patient an effective amoimt of a T cell population as described above. 

The present invention fiirther provides methods for inhibiting the 
15 development of a cancer in a patient, comprising the steps of: (a) incubating CD4"^ 
and/or CD8"^ T cells isolated fi*om a patient with one or more of: (i) a polypeptide 
comprising at least an immunogenic portion of polypeptide disclosed herein; (ii) a 
polynucleotide encoding such a polypeptide; and (iii) an antigen-presenting cell that 
expressed such a polypeptide; and (b) administering to the patient an effective amount 
20 of the proliferated T cells, and thereby inhibiting the development of a cancer in the 
patient Proliferated cells may, but need not, be cloned prior to administration to the 
patient. 

Within further aspects, the present invention provides methods for 
determining the presence or absence of a cancer, preferably an ovarian cancer, in a 

25 patient comprising: (a) contacting a biological sample obtained fiom a patient with a 
binding agent that binds to a polypeptide as recited above; (b) detecting in the sample an 
amount of polypeptide that binds to the binding agent; and (c) comparing the amount of 
polypeptide with a predetermined cut-off value, and therefi-om determining the presence 
or absence of a cancer in the patient Withiu preferred embodiments, the binding agmt 

30 is an antibody, more preferably a monoclonal antibody. 
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The present mvention also provides, within other aspects, methods for 
monitoring the progression of a cancer in a patient Such methods comprise the steps 
of: (a) contacting a biological sample obtained fix)m a patient at a first point in time 
vsdth a binding agent that binds to a polypeptide as recited above; (b) detecting in the 
5 sample an amount of polypeptide that binds to the binding agent; (c) repeating steps (a) 
and (b) using a biological sample obtained from the patient at a subsequent point in 
time; and (d) comparing the amount of polypeptide detected in step (c) \vith the amount 
detected in step (b) and there&om monitoring the progression of tihe cancer in tiie 
patient. 

1 0 The present invention further provides, within o&er aspects, methods for 

determining the presence or absence of a cancer in a patient, comprising the steps of: (a) 
contacting a biological sample obtained fix>m a patient with an oligonucleotide that 
hybridizes to a polynucleotide that encodes a polypeptide of the present invention; (b) 
detecting in the sample a level of a polynucleotide, preferably mRNA, that hybridizes to 

15 the oligonucleotide; and (c) comparing the level of polynucleotide that hybridizes to the 
oligonucleotide with a predetermined cxit-oflf value, and therefrom determining the 
presence or absence of a cancer in the patient Within certain embodiments, the amount 
of mRNA is detected via polymerase chain reaction using, for example, at least one 
oligonucleotide primer that hybridizes to a polynucleotide encoding a polypeptide as 

20 recited above, or a complement of such a polynucleotide. Within other embodiments, 
the amount of mRNA is detected using a hybridization technique, employii^ an 
oligonucleotide probe that hybridizes to a polynucleotide tiiat encodes a polypeptide as 
recited above, or a complement of such a polynucleotide. 

In related aspects, methods are provided for monitoring the progression 

25 of a cancer in a patient, comprising the steps of: (a) contacting a biological sample 
obtained from a patient with an oligonucleotide that hybridizes to a polynucleotide that 
encodes a polypeptide of the present invention; (b) detecting in the sample an amoimt of 
a polynucleotide that hybridizes to the oligonucleotide; (c) repeating steps (a) and (b) 
using a biological sample obtained from the patient at a subsequent point in time; and 

30 (d) comparing the amount of polynucleotide detected in step (c) with the amount 
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detected in step (b) and therefrom monitoring the progression of the cancer in Ae 
patient 

Within further aspects, the present invention provides antibodies, such as 
monoclonal antibodies, that bind to a polypeptide as described above, as well as 
5 diagnostic kits comprising such antibodies. Diagnostic kits comprising one or more 
oligonucleotide probes or primers as described above are also provided. 

These and other aspects of the present invention will become apparent 
upon reference to the following detailed description. All references disclosed herein are 
hereby incorporated by reference in their entirety as if each was incorporated 
10 individually. 

BRIEF DESCRIPTION OF THE SEQUENCE IDENTIFIERS 

SEQ ID N0:1, 2, 5, 9, 10, 13, 16, 19, 23, 27, 28, 32, 33, 35, 38, 41-50, 52, 53, 
56, 57, 63, 65, 69-72, 75, 78, 80-82, 84, 86, 89-93, 95, 97-100, 103, 107, 111, 114, 117, 
120, 121, 125, 128, 132-134, 136, 137, 140, 143-146, 148-151, 156, 158, 160-162, 166- 
15 168, 171, 174-183, 185, and 193-199 are described in Tables ffl-VH below. 

SEQ ID NO:200 is the amino acid sequence of a polypeptide encoded by 
thepolynucleotide recited in SEQ ID N0:182; 

SEQ ID NO:201 is the amino acid sequence of a polypeptide encoded by the 
polynucleotide recited in SEQ ID NO:182; 
20 SEQ ID NO:202 is the amino acid sequence of a polypeptide encoded by the 

poljraicleotide recited in SEQ ID NO:182. 

SEQ ID NO:203 is the detemiined extended cDNA sequence for SEQ ID 
NO:197. 

SEQ ID NO:204 is the determined extended cDNA sequence for SEQ ID 

25 NO:198. 

SEQ ID NO:205 is the determmed extended cDNA sequence for SEQ ID 
NO:199. 

SEQ ID NO:206 is the detemimed cDNA sequence for the coding region of 
0568S fused to an N-terminal His tag. 
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SEQ ID NO:207 is the amino acid sequence of ib& polypeptide encoded by ihe 
polynucleotide recited in SEQ ID NO:206. 

SEQ ID NO:208 is the detennined cDNA sequence for the coding region of 
GPR39 as downloaded from the High Throughput Genomics Database, 
5 SEQ ID NO:209 is the amino acid sequence encoded by the cDNA sequence 

recited in SEQ ID NO:208. 

SEQ ID NO:210 is the nucleotide sequence od O1034C an ovaiy specific EST 
clone discovered using electronic subtraction. 

SEQ ID N0:21 1 is the full length nucleotide sequence of 0591S. 
10 SEQ ID NO:212 is the sequence BF345141 which shows sequence homology 

with O1034C/O591S aUowing for the extension of 0591S. 

SEQ ID NO:213 is tibe sequence BE336607 which shows sequence homology 
with O1034C/O591S allowing for the extmsionof 0591S. 

SEQ ID NO:214 is the consensus nucleotide sequence of O1034C/O591S 
1 5 containing 1 897 base pairs. 

SEQ ID NO:215 is the predicted translation of the open reading frame identified 
within SEQ ID NO:214 (nucleotides 260-682). 

DETAILED DESCRIPllON OF IHE INVENTION 

The present invention is directed generally to compositions and their use 
20 in the therapy and diagnosis of cancer, particularly ovarian canc^. As described fiirther 
below, illustrative compositions of the present invention include, but are not restricted 
to, polypeptides, particularly immunogCTic polypeptides, polynucleotides encoding such 
polypeptides, antibodies and other binding agents, antigen presenting cells (APCs) and 
immune system cells (e.g., T cells). 

25 The practice of the present invention will employ, imless indicated 

specifically to the. contrary, conventional methods of virology, immunology, 
microbiology, molecular biology and recombinant DNA techniques within the skill of 
the art, many of vMch are described below for die purpose of illustration. Such 
techniques are explained fiiUy in the litemture. See, e.g:, Sambrook, et al. Molecular 

30 Cloning: A Laboratory Manual (2nd Edition, 1989); Maniatis et al. Molecular Cloning: 
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A Laboratory Manual (1982); DNA Cloning: A Practical Approach, vol. I & U (D. 
Glover, ed.); Oligonucleotide Synthesis (N. Gait, ed, 1984); Nucleic Acid 
Hybridization (B. Hames & S. Higgins, eds., 1985); Transcription and Translation (B. 
Hames & S. Higgins, eds., 1984); Animal Cell Culture (R. Freshney, ed, 1986); Perbal, 
5 A Practical Guide to Molecular Cloning (1984). 

All publications, patents and patent applications cited herein, whether 
supra or infra, are hereby incorporated by reference m their entirety. 

As used in this specification and the app^ded claims, the siii^ar forms 
"a," "an" and "die!* include plural references unless the content clearly dictates 
10 otherwise. 

Polypeptide Compositions 

As used herein, the term "polypeptide" " is used in its conventional 
meaning, z.e., as a sequence of amino acids. The polypeptides are not limited to a 
specific length of the prodxict; thxis, peptides, oligopeptides, and proteins are included 

15 within the definition of polypeptide, and such terms may be used interchangeably herein 
unless specifically indicated otherwise. This term also does not refer to or exclude post- 
expression modifications of the polypeptide, for example, glycosylations, acetylations, 
phosphorylations and the like, as well as other modifications known in the art, both 
naturally occurring and non-naturally occurring. A polypeptide may be an entire 

20 protein, or a subsequence thereof. Particular polypeptides of interest in the context of 
this invention are amino acid subsequences comprising epitopes, z.e., antigenic 
determinants substantially responsible for the immunogenic properties of a polypeptide 
and being capable of evoking an immune response. 

Particularly illustrative polypeptides of the present invention comprise 

25 those encoded by a polynucleotide sequence set forth in any one of SEQ ID N0:1, 2, 5, 
9, 10, 13, 16, 19, 23, 27, 28, 32, 33, 35, 38, 41-50, 52, 53, 56, 57, 63, 65, 69-72, 75, 78, 
80-82, 84, 86, 89-93, 95, 97-100, 103, 107, 111, 114, 117, 120, 121, 125, 128, 132-134, 
136, 137, 140, 143-146, 148-151, 156, 158, 160-162, 166-168, 171, 174-183, 185, 193- 
199, 203-206, 208, and 210-214 or a sequence that hybridizes under moderately 

30 stringent conditions, or, alternatively, under highly stringent conditions, to a 



10 



wo 02/39885 PCT/USO 1/45395 



polynucleotide sequence identified above. Certain other illustrative polypeptides of the 
invention comprise amino acid sequences as set forth in any one of SEQ ID NO:200- 
202, 207, 209 and 215. 

The polypeptides of the present invention are sometimes herein referred 
5 to as ovarian tumor proteins or ovarian tumor polypeptides, as an indication that their 
identification has been based at least in part upon their increased levels of expression in 
ovarian tumor samples. Thus, a "ovarian tumor polypeptide" or "ovarian tumor 
protein," refers generally to a polypeptide sequence of the present invention, or a 
polynucleotide sequence encoding such a polypeptide, that is expressed in a substantial 

10 proportion of ovarian tumor samples, for example preferably greater than about 20%, 
more prefembly greater than about 30%, and most preferably greater than about 50% or 
more of ovarian tumor samples tested, at a level tfiat is at least two fold, and preferably 
at least five fold, greater than the level of expression in normal tissues, as determined 
using a representative assay provided herein. An ovarian tumor polypeptide sequence 

15 of the invention, based upon its increased level of expression in tumor cells, has 
particular utility both as a diagnostic marker as well as a th»:apeutic target, as fiirther 
described below. 

In certain preferred embodiments, the polypeptides of the invention are 
immunogenic, /.e., th^ react detectably within an immunoassay (such as an ELISA or 

20 T-cell stunulation assay) with antisera and/or T-cells from a patient with ovarian cancer. 
Screening for unmunogenic activity can be performed using techniques well known to 
ihe skilled artisan. For example, such screens can be performed using methods such as 
those described in Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring 
Harbor Laboratory, 1988. In one illustrative example, a polypeptide may be 

25 immobilized on a solid support and contacted with patient s^ to allow binding of 
antibodies within the sera to the immobilized polypeptide. Unbound sera may then be 
removed and boxmd antibodies detected using, for example, ^^^Mabeled Protein A. 

As would be recognized by the skilled artisan, immtmogenic portions of 
the polypeptides disclosed herein are also encompassed by the present invention. An 

30 'immunogenic portion,^ as used herein, is a firagment of an immunogenic polypeptide 
of the invention that itself is immunologically reactive {Le., specifically binds) wi& the 
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B-cells and/or T-cell surface antigen receptors that recognize the polypeptide. 
Immunogenic portions may generally be identified using well known techniques, such 
as those summarized in Paul, Fundamental Immunology^ 3rd ed., 243-247 (Raven Press, 
1993) and references cited therein. Such techniques include screening polypeptides for 
5 the ability to react with antigen-specific antibodies, antisera and/or T-cell lines or 
clones. As used herein, antisera and antibodies are "antigen-specific" if they 
specifically bind to an antigen {ie., they react vnih the protein in an ELISA or other 
immunoassay, and do not react detectably with unrelated protems). Such antisera and 
antibodies may be prepared as described herein, and using well-known techniques. 

10 In one preferred embodiment, an immunogenic portion of a polypeptide 

of the present invention is a portion that reacts witii antisera and/or T-cells at a level ibat 
is not substantially less than the reactivity of the full-length polypeptide (e.g., in an 
ELISA and/or T-cell reactivity assay). Preferably, the level of immunogenic activity of 
the immunogCTic portion is at least about 50%, preferably at least about 70% and most 

15 preferably greater than about 90% of the immunogenicity for the fiiU-length 
polypeptide. In some instances, preferred immunogenic portions will be identified that 
have a level of inmaunogenic activity greater than that of the corresponding full-length 
polypeptide, e.g., having greater than about 100% or 150% or more immunogenic 
activity. 

20 In certain other embodiments, illustrative immunogenic portions may 

include peptides in which an N-tefminal leader sequence and/or transmembrane domain 
have been deleted. Other illustrative inununogenic portions will contain a small N- 
and/or C-terminal deletion (eg., 1-30 amino acids, preferably 5-15 amino acids), 
relative to the mature protein. 

25 In another embodiment, a polypeptide composition of the invention may 

also comprise one or more polypeptides that are immunologically reactive with T cells 
and/or antibodies generated against a polypeptide of the invention, particularly a 
polypeptide having an amino acid sequence disclosed herein, or to an immimogenic 
fiagment or variant thereof. 

30 In another embodiment of the invention, polypeptides are provided that 

comprise one or more polypeptides tiiat are enable of eliciting T cells and/or antibodies 
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that aie immunologically leactive with one or more polypeptides described herein, or 
one or more polypeptides encoded by contiguous nucleic acid sequences contained in 
the polynucleotide sequences disclosed herein, or immunogenic fragments or variants 
thereof or to one or more nucleic acid sequences which hybridize to one or more of 
5 these sequences under conditions of moderate to high stringency. 

The present invention, in another aspect, provides polypeptide fragments 
comprising at least about S, 10» 15, 20, 25, 50, or 100 contiguous amino acids, or more, 
including all intennediate lengths, of a polypeptide compositions set forth herein, such 
as those set forth in SEQ ID NO:200-202, 207, 209, and 215 or those encoded by a 
10 polynucleotide sequence set forth in any one of SEQ ID N0:1, 2, 5, 9, 10, 13, 16, 19, 
23, 27, 28, 32, 33, 35, 38, 41-50, 52, 53, 56, 57, 63, 65, 69-72, 75, 78, 80-82, 84, 86, 89- 
93, 95, 97-100, 103, 107, 111, 114, 117, 120, 121, 125, 128, 132-134, 136, 137, 140, 
143-146, 148-151, 156, 158, 160-162, 166-168, 171, 174-183, 185, 193-199, 203-206, 
208, and 210-214. 

15 In another aspect, the present invention provides variants of the 

polypeptide compositions described herein. Polypeptide variants generally 
encompassed by the present invention will typically exhibit at least about 70%, 75%, 
80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more identity 
(detemuned as described below), along its lengdi, to a polypeptide sequences set forth 

20 herein. 

In one preferred embodiment, the polypeptide fragments and variants 
provide by the preset invention are immunologically reactive witii an antibody and/or 
T-cell that reacts with a full-length polypeptide specifically set for the herein. 

In another preferred embodiment, the polypeptide fragments and variants 
25 provided by the present invention exhibit a level of immunogenic activity of at least 
about 50%, preferably at least about 70%, and most preferably at least about 90% or 
more of that exhibited by a fiiU-lengtii polypeptide sequence specifically set forth 
herein. 

A polypeptide "variant," as the term is used herein, is a polypeptide that 
30 typically differs from a polypeptide specifically disclosed herein in one or more 
substitutions, deletions, additions and/or iosertions. Such variants may be naturally 
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occurring or may be synthetically generated, for example, by modifying one or more of 
the above polypeptide sequences of the invention and evaluating their immunogenic 
activity as described herein and/or using any of a number of techniques well known in 
the art. 

5 For example, certain illustrative variants of the polypeptides of the 

invention include those in which one or more portions, such as an N-terminal leader 
sequence or transmembrane domain, have been removed. Other illustrative variants 
include variants in which a small portion (e.g., 1-30 amino acids, preferably 5-15 amino 
acids) has been removed from flie N- and/or C-terminal of the ntiature protein. 

10 In many instances, a variant will contain consCTvative substitutions. A 

"conservative substitution" is one in which an amino acid is substituted for another 
amino acid that has similar properties, such that one skilled in the art of peptide 
chraiistry would expect the secondary structure and hydropathic nature of the 
polypeptide to be substantially imchangeA As described above, modifications may be 

15 made in the structure of the polynucleotides and polypeptides of the present invention 
and still obtain a functional molecule that, encodes a variant or derivative polypeptide 
with desimble characteristics, with immunogenic characteristics. When it is 
desired to alter the amino acid sequence of a polypeptide to create an equivalent, or 
even an improved, immunogenic variant or portion of a polypeptide of the invention, 

20 one skilled in the art will typically change one or more of flie codons of the encoding 
DNA sequence according to Table 1. 

For example, certain amino acids may be substituted for other amino 
acids in a protein structure without appreciable loss of interactive binding capacity with 
structures such as, for example, antigen-binding regions of antibodies or binding sites 

25 on substrate molecules. Since it is the interactive capacity and nature of a protein that 
defines that protein's biological functional activity, certain amino acid sequence 
substitutions can be made in a protein sequence, and, of course, its underlying DNA 
coding sequence, and nevertheless obtain a protein with like properties. It is thus 
contemplated that various changes may be made in the peptide sequences of the 

30 disclosed compositions, or corresponding DNA sequences which encode said peptides 
without appreciable loss of their biological utility or activity. 
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In making such changes, the hydropathic index of amino acids may be 
5 considered. The importance of the hydropathic amino acid index in conferring 
interactive biologic function on a protein is generally understood in the art (Kyte and 
Doolittle, 1982, incorporated herein by reference). It is accepted that the relative 
hydropathic character of the amino acid contributes to the secondary structure of the 
resultant protein, which in turn defines the interaction of the protein with other 
10 molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, and 
the like. Each amino acid has been assigned a hydropathic index on the basis of its 
hydrophobicity and charge characteristics (Kyte and Doolittle, 1982). These values are: 
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isoleucine (+4.5); valine (+42); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine 
(+2.5); metbionine (+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); 
tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); 
glutanaine (-3.5); aspartate (-3.5); aspar^ine (-3.5); lysine (-3.9); and arginine (-^.5). 
5 It is known in the art that certain amino acids may be substituted by other 

amino acids having a similar hydropathic index or score and still resiJt in a protein with 
similar biological activity, i,e. still obtain a biological functionally equivalent protein, 
hi making such changes, the substitution of amino acids whose hydropathic indices are 
within +2 is preferred, those within +1 are particularly preferred, and those within ±0.5 

10 are even more particularly preferred It is also understood in the art tiiat the substitution 
of like amino acids can be made efifectively on the basis of hydrophilicity. U. S. Patent 
4,554,101 (specifically incorporated herein by reference in its entirety), states that the 
greatest local average hydrophilicity of a protein, as govCTied by the hydrophilicity of 
its adjacent amino acids, correlates with a biological property of the protein. 

15 As detailed in U. S. Patent 4,554,101, the following hydrophilicity values 

have been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate 
(+3.0 ± 1); glutamate (+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); 
glycine (0); threonine (-0.4); proline (-0.5 ± 1); alanine (-0.5); histidine (-0.5); cysteine 
(-1.0); methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (- 

20 2.3); phenylalanine (-2.5); tryptophan (-3.4). It is understood that an anuno acid can be 
substituted for another having a similar hydrophilicity value and still obtain a 
biologically equivalmt, and in particular, an immunologically equivalent protein. In 
such changes, the substitution of amino acids whose hydrophilicity values are within i2 
is preferred, those within ±1 are particularly preferred, and those within ±0.5 are even 

25 more particularly preferred. 

As outlined above, amino acid substitutions are generally therefore based 
on the relative similarity of the amino acid side-chain substituents, for example, their 
hydrophobicity, hydrophilicity, charge, size, and the Uke. Exemplary substitutions that 
take various of the foregoing characteristics into consideration are well known to those 

30 of skill in the art and include: arginine and lysine; glutamate and aspartate; s^ine and 
threonine; glutamine and asparagine; and valine, leucine and isoleucine. 
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Iq addition, any polynucleotide may be further modified to increase 
stability in vivo. Possible modifications include, but are not limited to, the addition of 
flanking sequences at the 5' and/or 3' ends; the use of phosphorothioate or T 0-methyl 
rather than phosphodiesterase linkages in the backbone; and/or the inclusion of 
5 nontraditional bases such as inosine, queosine and wybutosine, as well as acetyl- 
metidyl-, thio- and other modified forms of adenine, cytidine, guanine, thymine and 
uridine. 

Amino acid substitutions may fiirther be made on the basis of similarity 
in polarity, charge, solubility, hydrophobicity, hydrophilicity and/or the amphipatfaic 

10 nature of the residues. For example, negatively charged amino acids include aspartic 
acid and glutamic acid; positively charged amino acids include lysine and arginine; and 
amino acids with imcharged polar head groups having similar hydrophilicity values 
include leucine, isoleucine and valine; glycine and alanine; asparagine and glutamine; 
and serine, threonine, phenylalanine and tyrosine. Other groups of amino acids that may 

15 represent conservative changes include: (1) ala, pro, gly, glu, asp, gin, asn, ser, thr; 
(2) cys, ser, tyr, thr, (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, tip, 
his. A variant may also, or alternatively, contain nonconservative changes. In a 
preferred embodiment, variant polypeptides differ from a native sequence by 
substitution, deletion or addition of five amino acids or fewer. Variants may also (or 

20 alt^tiatively) be modified by, for example, the deletion or addition of amino acids that 
have minimal influence on the immunogenicity, secondary structure and hydropafliic 
nature of the polypeptide. 

As noted above, polypeptides may comprise a signal (or leader) sequence 
at the N-terminal end of the protein, which co-translationally or post-translationally 

25 directs transfer of the protein. The polypeptide may also be conjugated to a linker or 
other sequence for ease of synthesis, purification or identification of the polypeptide 
(e.g., poly-His), or to enhance binding of the polypeptide to a solid support For 
example, a polypeptide may be conjugated to an immunoglobulin Fc regioa 

When comparing polypeptide sequences, two sequences are said to be 

30 "identical" if the sequence of amino acids in the two sequences is the same when 
aligned for maximum correspondence, as described below. Comparisons between two 
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sequences are typically perfomied by comparing the sequences over a comparison 
window to identify and compare local regions of sequence similarity. A "comparison 
window" as used herein, refers to a segment of at least about 20 contiguous positions, 
usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a 
5 reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 

Optimal aligmnent of sequences for comparison may be conducted using 
the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 
Inc., Madison, WT), using default parameters. This program embodies several 

10 alignment schemes described in the following references: Dayhoffi M.O. (1978) A 
model of evolutionary change in proteins - Matrices for detecting distant relationships. 
In Dayhoflf, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Research Foundation, Washington DC Vol. 5, SuppL 3, pp. 345-358; Hein J. (1990) 
Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 

15 vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and Muller W. (1988) CABIOS 4:\\A1\ Robinson, 
E.D. (1971) Comb. Theor 11:105; Santou, N. Nes, M. (1987) MoL Biol Evol ^:406- 
425; Sneath, PJH.A. and Sokal, R-R. (1973) Numerical Taxonomy - the Principles and 
Practice of Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, W.J. and 

20 Lipman, DJ. (1983) Proc. Natl Acad, Sci USA 50:726-730. 

Alternatively, optimal alignment of sequences for comparison may be 
conducted by the local identity algorithm of Smith and Waterman (1981) Add APL 
Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 
Mol Biol 48:443, by the search for similarity mefliods of Pearson and Lipman (1988) 

25 Proc. Natl Acad Sci. USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFTT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WT), 
or by inspection. 

One preferred example of algorithms that are suitable for determining 
30 percent sequence identity and sequence similarity are tiie BLAST and BLAST 2.0 
algorithms, which are described in Altschul et al. (1977) Nucl Acids Res. 25:3389-3402 
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and Altschul et al. (1990) X Mol Biol 215:403-410, lespectively. BLAST and BLAST 
2.0 can be used, for example with the parameters described herem, to detennine percent 
sequence identity for the polynucleotides and polypeptides of the invention. Software 
for performing BLAST analyses is publicly available through the National Center for 
5 Biotechnology Information. For amino acid sequences, a scoring matrix can be used to 
calculate the cumulative score. Extension of the word hits in each direction are halted 
when: the cumulative alignment score falls off by the quantity X ftoxsx its mayimuni 
achieved value; the cumulative score goes to zero or below, due to the accumulation of 
one or more negative-scoring residue aligmnents; or the end of either sequence is 

10 reached. The BLAST algorithm paiameteis W, T and X determine &e sensitivity and 
speed of the alignment 

In one preferred approach, the **percentage of sequence identity" is 
detennined by comparing two optimally aligned sequences over a window of 
comparison of at least 20 positions, wherein the portion of the polypeptide sequence in 

15 the comparison window may comprise additions or deletions (f.e., gaps) of 20 percent 
or less, usually 5 to 15 percent, or 10 to 12 percent, as compared to the reference 
sequences (which does not comprise additions or deletions) for optimal alignment of the 
two sequences. The percentage is calculated by detemiinmg the number of positions at 
^^ch the identical amino acid residue occurs in both sequences to yield the number of 

20 matched positions, dividing the number of matched positions by the total number of 
positions in tiie reference sequence {Le., the window size) and multiplying the results by 
100 to yield the percent£^e of sequence identity. 

Withm other illustrative embodiments, a polypeptide may be a fusion 
polypeptide tiiat comprises multiple polypeptides as described herein, or lhat comprises 

25 at least one polypeptide as described herein and an unrelated sequence, such as a known 
tumor protein, A fusion partner may, for example, assist in providing T helper epitopes 
(an immimological fusion partner), prefembly T helper epitopes recognized by humans, 
or may assist in expressing the protein (an expression enhancer) at higher yields than the 
native recombinant protein. Certain preferred fusion partners are both immunological 

30 and expression enhancmg fusion partners. Other fusion partners may be selected so as 
to increase the solubility of the polypeptide or to enable the polypeptide to be targeted to 
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desired intracellular compartments. Still further fusion partners include a£Snity tags, 
which fecilitate purification of the polypeptide. 

Fusion polypeptides may generally be prepared using standard 
techniques, including chemical conjugation. Preferably, a fusion polypeptide is 
5 expressed as a recombinant polypeptide, allowing the production of increased levels, 
relative to a non-fused polypeptide, in an expression system. Briefly, DNA sequences 
encoding the polypeptide components may be assembled separately, and ligated into an 
appropriate expression vector. Ihe 3' end of the DNA sequence encoding one 
polypeptide component is ligated, with or without a peptide linker, to the S* end of a 

10 DNA sequence encoding the second polypeptide component so that the reading frames 
of the sequences are in phase. This permits translation into a single fusion polypeptide 
that retams the biological activity of both component polypeptides. 

A peptide linker sequence may be employed to separate the first and 
second polypeptide components by a distance sufficient to ensure that each polypeptide 

15 folds into its secondary and tertiary structures. Such a peptide linker sequence is 
incorporated into the fusion polypeptide using standard techniques well known in the 
art. Suitable peptide linker sequences may be chosen based on the following factors: 
(1) their ability to adopt a flexible extended conformation; (2) their inability to adopt a 
secondary structure that could interact with functional q)itopes on the first and second 

20 polypeptides; and (3) the lack of hydrophobic or charged residues that might react with 
the polypeptide functional epitopes. Preferred peptide linker sequences contain Gly, 
Asn and Ser residues. Other near neutral amino acids, such as Thr and Ala may also be 
used in the linker sequence. Amino acid sequences v^ch may be usefully employed as 
linkers include those disclosed in Maratea et al.. Gene ^0:39-46, 1985; Murphy et al., 

25 Proc. Natl. Acad ScL USA 55:8258-8262, 1986; U.S. Patent No. 4,935,233 and U.S. 
Patent No. 4,75 1 , 1 80. The linker sequence may generally be fix)m. 1 to about 50 amino 
acids in length. Linker sequences are not required when the first and second 
polypeptides have non-essential N-terminal amino acid regions that can be used to 
separate the functional domains and prevent steric int^erence. 

30 The ligated DNA sequences are operably linked to suitable 

transcriptional or translational regulatory elements. The regulatory elements 
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responsible for expression of DNA are located only 5' to the DNA sequence encoding 
the first polypeptides. Similarly, stop codons required to end translation and 
transcription termination signals are only present 3' to the DNA sequence encoding the 
second polypeptide. 

5 The fusion polypeptide can comprise a polypeptide as described herein 

together with an unrelated inmiunogenic protein, such as an immunogenic protein 
capable of eliciting a recall response. Examples of such proteins include tetanus, 
tuberculosis and hepatitis proteins {see, for example, Stoute et ai. New Engl J, Med., 
535:86-91, 1997). 

10 In one preferred embodiment, the inGmunological fusion partner is 

derived from a Mycobacterium sp., such as a Mycobacterium tuberculosis-derived Ral2 
fi-agment Ral2 compositions and methods for their use in enhancing the expression 
' and/or inmiunogenicity of heterologous polynucleotide/polypeptide sequences is 
described in U.S. Patent Application 60/158,585, the disclosure of which is 

15 incorporated herein by reference in its entirety. Briefly, Ral2 refers to a polynucleotide 
region that is a subsequence of a Mycobacterium tuberculosis MTB32A nucleic acid. 
MTB32A is a serine protease of 32 KD molecular weight encoded by a gene in virulent 
and avirulent strains of M tuberculosis. The nucleotide sequence and anodno acid 
sequence of MTB32A have been described (for example, U.S. Patent Application 

20 60/158,585; see also, Skeiky et al. Infection and Immun. (1999) 67:3998-4007, 
incorporated h^ein by reference). C-terminal fragments of the MTB32A coding 
sequence express at high levels and remain as a soluble polypeptides throughout the 
purification process. Moreover, Ral2 may enhance tihe inununogenicity of heterologous 
immunogenic polypeptides with which it is fused. One preferred Ral2 fusion 

25 polypeptide comprises a 14 BCD C-terminal fi:agment corresponding to amino acid 
residues 192 to 323 of MTB32A. Other preferred Ral2 polynucleotides generally 
comprise at least about 15 consecutive nucleotides, at least about 30 nucleotides, at least 
about 60 nucleotides, at least about 100 nucleotides, at least about 200 nucleotides, or at 
least about 300 nucleotides that encode a portion of a Ral2 polypeptide. Ral2 

30 polynucleotides may comprise a native sequence {Le., an endogenous sequmce that 
encodes a Ral2 polypeptide or a portion thereoQ or may comprise a variant of such a 
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sequence. Ral2 polynucleotide variants may contain one or more substitutions, 
additions, deletions and/or insertions such that the biological activity of the encoded 
fusion polypeptide is not substantially diminished, relative to a fusion polypeptide 
comprising a native Ral2 polypeptide. Variants preferably exhibit at least about 70% 
5 identity, more preferably at least about 80% identity and most preferably at least about 
90% identity to a polynucleotide sequence that encodes a native Ral2 polypeptide or a 
portion thereof. 

Within other preferred embodiments, an immunological fusion partner is 
derived fiom protein D, a sur&ce protein of tibie gram-negative bacterium Haemophilus 

10 influenza B (WO 91/18926). Preferably, a protein D derivative comprises 
approximately the first third of the protein (e,g., flxe first N-tenninal 100-110 amino 
acids), and a protein D derivative may be lipidated. Within certain preferred 
embodiments, the first 109 residues of a Lipoprotem D fusion partner is included on the 
N-terminus to provide the polypeptide with additional exogenous T-cell epitopes and to 

15 increase the expression level in E. coli (thus functioning as an expression enhancer). 
The lipid tail ensures optimal presentation of the antigen to antigen presenting cells. 
Other fusion partners include the non-structural protein from influenzae virus, NSl 
(hemaglutinin). Typically, the N-tenninal 81 amino acids are used, although different 
fiiagments that include T-helper epitopes may be used. 

20 In another embodiment, the immunological fusion partner is the protein 

known as LYTA, or a portion thereof (preferably a C-tCTninal portion). LYTA is 
derived firom Streptococcus pneumoniae, which synthesizes an N-acetyl-L-alanine 
amidase known as amidase LYTA (encoded by the LytA gene; Gene 43:265-292, 1986). 
LYTA is an autolysin that specifically degrades certain bonds in the peptidoglycan 

25 backbone. The C-terminal domain of the LYTA protein is responsible for the afiSnity to 
the choline or to some choline analogues such as DEAE. This property has been 
exploited for tiie development of E. coli C-LYTA expressing plasmids useful for 
expression of fusion proteins. Purification of hybrid proteins containing the C-LYTA 
firagment at the amino terminus has been described (see Biotechnology 70:795-798, 

30 1992). Within a preferred embodiment, a repeat portion of LYTA naay be incorporated 



22 



wo 02/39S85 



PCT/USOl/45395 



into a fusion polypeptide. A repeat portion is found in the C-terminal region starting at 
residue 178. A particularly preferred repeat portion incorporates residues 188-305. 

Yet another illustrative embodiment involves fusion polypeptides, and 
the polynucleotides encoding them, wherem the fusion partner comprises a targeting 
5 signal enable of directing a polypeptide to the endosomal/lysosomal compiartment, as 
described in U.S. Patent No. 5,633^34. An immunogenic polypeptide of the invention, 
when fused with this targeting signal, will associate more efficiently with MHC class II 
molecules and thereby provide enhanced in vivo stunulation of CD4* T-cells specific 
for the polypeptide. 

10 Polypeptides of the invention are prepared using any of a variety of well 

known synthetic and/or recombinant techniques, tiie latter of which are further 
described below. Polypeptides, portions and other variants generally less tiian about 
150 amino acids can be generated by synthetic means, using techniques well known to 
those of ordinary skill in the art In one illustrative example, such polypeptides are 

15 synthesized using any of the commercially available solid-phase techniques, such as the 
Merrifield solid-phase synthesis method, where amino acids are sequentially added to a 
growing amino acid chain. See Merrifield, Am, Chem. Soc, 55:2149-2146, 1963. 
Equipment for automated synthesis of polypeptides is conunercially available fix)m 
suppliers such as Perkin Elmer/Applied BioSystems Division OPoster City, CA), and 

20 may be operated according to the manufitcturer's instructions. 

Li general, polypeptide compositions (includmg fiision polypeptides) of 
the invention are isolated. An "isolated" polypeptide is one tiiat is removed firom its 
original environment For example, a naturally-occurring protem or polypeptide is 
isolated if it is separated fix>m some or all of tiie coexisting materials in the natural 

25 system. Preferably, such polypeptides are also purified, e.g., are at least about 90% 
pure, more preferably at least about 95% pure and most preferably at least about 99% 
pure. 

Polynucleotide Compositions 

The present invention, in. other aspects, provides polynucleotide 
30 compositions. The terms "DNA" and "polynucleotide" are used essentially 
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interchangeably herein to refer to a DNA molecule that has been isolated free of total 
genomic DNA of a particular species. "Isolated,'' as used herein, means lhat a 
polynucleotide is substantiaDy away from other coding sequences, and that the DNA 
molecule does not contain large portions of unrelated coding DNA, such as large 
5 chromosomal fragments or other ftmctional genes or polypeptide coding regions. Of 
course, this refers to the DNA molecule as originally isolated, and does not exclude 
genes or coding regions lat^ added to the segment by Ihe hand of man. 

As will be understood by those skilled in the art, the polynucleotide 
compositions of tins invention can include genomic sequences, extra--genomic and 

10 plasmid-encoded sequences and smaller engiaeered gene segments that express, or may 
be adapted to express, proteins, polypeptides, peptides and the like. Such segments may 
be naturally isolated, or modified synthetically by the hand of man. 

As will be also recognized by the skilled artisan, polynucleotides of the 
invention may be single-stranded (coding or antisense) or double-stranded, and may be 

15 DNA (genomic, cDNA or synthetic) or KNA molecules. RNA molecules may include 
HnRNA molecules, which contain introns and correspond to a DNA molecule in a one- 
to-one manner, and mRNA molecules, which do not contain introns. Additional coding 
or non-coding sequences may, but need not, be present within a polynucleotide of the 
present invention, and a polynucleotide may, but need not, be linked to other molecules 

20 and/or support materials. 

^ Polynucleotides msy coqiprise a native sequence (i.e., an endogenous 
sequence that encodes a polypeptide/protein of the invention or a portion thereof) or 
may comprise a sequence that encodes a variant or derivative, preferably and 
inmiunogenic variant or derivative, of such a sequence. 

25 Therefore, accordiixg to another aspect of the present invention, 

polynucleotide compositions are provided that comprise some or all of a polynucleotide 
sequence set forth in any one of SEQ ID N0:1, 2, 5, 9, 10, 13, 16, 19, 23, 27, 28, 32, 33, 
. 35, 38, 41-50, 52, 53, 56, 57, 63, 65, 69-72, 75, 78, 80-82, 84, 86, 89-93, 95, 97-100, 
103, 107, 111, 114, 117, 120, 121, 125, 128, 132-134, 136, 137, 140, 143-146, 148-151, 

30 156, 158, 160-162, 166-168, 171, 174-183, 185, 193-199, 203-206, 208 and 210-214, 
compl^ents of a polynucleotide sequence set forth as described above, and degenerate 
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variants of a polynucleotide sequence set forth as described above. In certain preferred 
embodiments, the polynucleotide sequences set forth herein encode immunogenic 
polypeptides, as described above. 

In other related embodiments, the present invention provides 
5 polynucleotide variants having substantial identity to the sequences disclosed herein in 
SEQ ID N0:1, 2, 5, 9, 10, 13, 16, 19, 23, 27, 28, 32, 33, 35, 38, 41-50, 52, 53, 56, 57, 
63, 65, 69-72, 75, 78, 80-82, 84, 86, 89-93, 95, 97-100, 103, 107, 111, 114, 117, 120, 
121, 125, 128, 132-134, 136, 137, 140, 143-146, 148-151, 156, 158, 160-162, 166-168, 
171, 174-183, 185, 193-199, 203-206, 208 and 210-214, for example those comprising 

10 at least 70% sequence identity, preferably at least 75%, 80%, 85%, 90%, 95%, 96%, 
97%, 98%, or 99% or higlher, sequence identity compared to a polynucleotide sequence 
of this invention iising the methods described herein, (eg., BLAST analysis using 
standard parameters, as described below). One skilled in this art will recognize that 
these values can be appropriately adjusted to determine corresponding identity of 

1 5 proteins encoded by two nucleotide sequences by taking into accoxmt codon degeneracy, 
amino acid similarity, reading frame positioning and the like. 

Typically, polynucleotide variants will contain one or more substitutions, 
additions, deletions and/or insertions, preferably such that the immunogenicity of the 
polypeptide encoded by the variant polynucleotide is not substantially diminished 

20 relative to a polypeptide encoded by a polynucleotide sequence specifically set forth 
herein). The term "Variants" should also be understood to encompasses homologous 
genes of xenogenic origia 

In additional embodiments, the present invention provides 
polynucleotide firagments comprising various lengths of contiguous stretches of 

25 sequence identical to or complementary to one or more of the sequences disclosed 
herein- For example, polynucleotides are provided by this invention that comprise at 
least about 10, 15, 20, 30, 40, 50, 75, 100, 150, 200, 300, 400, 500 or 1000 or more 
contiguous nucleotides of one or more of the sequences disclosed herein as well as all 
intermediate lengths there between. It will be readily understood that "intermediate 

30 lengths", in this context, means any length between the quoted values, such as 16, 17, 
18, 19, etc.; 21, 22, 23, etc.; 30, 31, 32, etc.; 50, 51, 52, 53, etc.; 100, 101, 102, 103, 
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etc,\ 150, 151, 152, 153, etc,\ including all integers through 200-500; 500-1,000, and the 
like. 

In another embodiment of the invention, polynucleotide compositions are 
provided that are capable of hybridizing under moderate to high stringency conditions to 
5 a polynucleotide sequence provided herein, or a fragment thereof^ or a complementary 
sequence thereof Hybridization techniques are well known in the art of molecular 
biology. For purposes of illustration, suitable moderately stringent conditions for 
testing the hybridization of a polynucleotide.of this invention with other polynucleotides 
include prewashing in a solution of 5 X SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0); 

10 hybridizing at 50*'C-60^C, 5 X SSC, overnight; followed by washing twice at 65^*0 for 
20 minutes with each of 2X, 0.5X and 0.2X SSC containing 0.1% SDS. One skilled in 
the art will understand that tiie stringency of hybridization can be readily manipulated, 
such as by altering the salt content of the hybridization solution and/or the temperature 
at which the hybridization is performed. For example, in another embodiment, suitable 

15 highly stringent hybridization conditions include those described above, with the 
exception that the temperature of hybridization is increased, e.^., to 60-65°C or 65- 
70^C. 

In certain preferred embodiments, the polynucleotides described above, 
e.g:, polynucleotide variants, fragments and hybridi^g sequences, encode polypeptides 

20 tiiat are immunologically cross-reactive with a polypeptide sequence specifically set 
forth hereiiL In other preferred embodiments, such polynucleotides encode 
polypeptides that have a level of immunogenic activity of at least about 50%, preferably 
at least about 70%, and more preferably at least about 90% of tiiat for a polypqptide 
sequence specifically set forth hereiit 

25 The polynucleotides of the present invention, or fi:agments thereof, 

regardless of tiie length of the coding sequence itself, may be combined with other DNA 
sequences, such as promoters, polyadenylation signals, additional restriction enzyme 
sites, multiple cloning sites, other coding segments, and the like, such that their overall 
length may vary considerably. It is therefore contemplated that a nucleic acid firagment 

30 of almost any length may be employed, witii the total length preferably being limited by 
the ease of preparation and use in Ihe intended recombinant DNA protocol. For 
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example, illustrative polynucleotide segments with total lengths of about 10,000, about 
5000, about 3000, about 2,000, about 1,000, about 500, about 200, about 100, about 50 
base pairs in length, and the like, (including all intennediate lengths) are contemplated 
to be useful m many implementations of this inventioiL 
5 When comparing polynucleotide sequences, two sequences are said to be 

"identical'' if the sequence of nucleotides in the two sequences is the same when aligned 
for maximum correspondence, as described below. Comparisons between two 
sequences are typically performed by comparing the sequences over a comparison 
window to identify and compare local re^ons of sequence similarity. A "comparison 

10 window" as used herein, refers to a segment of at least about 20 contiguous positions, 
usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a 
reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using 

15 the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 
Inc., Madison, WI), using default parameters. This program embodies several 
alignment schemes described in the following references: Dayhoflf, M.O. (1978) A 
model of evolutionary change in proteins - Matrices for detecting distant relationships. 
In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 

20 Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hem J. (1990) 
Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Emymology 
vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and MuUer W. (1988) CABIOS ¥:11-17; Robinson, 
E.D. (1971) Comb. Theor 77:105; Santou, N. Nes, M. (1987) Mol Biol Evol 4:406- 

25 425; Sneath, P.IIA. and Sokal, FLR. (1973) Numerical Taxonomy - the Principles and 
Practice of Numerical Taxonomy^ Freeman Press, San Francisco, CA; Wilbur, WJ. and 
Lipman, D.J. (1983) Proc. Natl Acad, Set USA 80:726-730. 

Alternatively, optimal alignment of sequences for comparison may be 
conducted by the local identity algoritimi of Smith and Waterman (1981) Add APL 

30 Math 2:482, by tiie identity alignment algorithm of Needleman and Wunsch (1970) J. 
Mol BioL 48:443, by the search for similarity metiiods of Pearson and Lipman (1988) 
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Proc. Natl Acad. Set USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFTT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection. 

5 One preferred example of algorithms that are suitable for determining 

percent sequence identity and sequence similarity are the BLAST and BLAST 2,0 
algorithms, which are described in Altschul et al. (1977) NucL Acids Res. 25:3389-3402 
and Altschul et ai. (1990) /. MoL Biol 215:403-410, respectively. BLAST and BLAST 
2.0 can be used, for example wi& the parameters described herein, to determine percent 

10 sequence identity for the polynucleotides of the invention. Software for performing 
BLAST analyses is publicly available through the National Center for Biotechnology 
Information. In one illustrative example, cumulative scores can be calculated using, for 
nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always >0) and N (penalty score for mismatching residues; always <0). Extension of 

15 the word hits in each direction are halted when: the cumulative aligmnent score falls off 
by the quantity X from its maximum achieved value; the cimiulative score goes to zero 
or below, due to the accumulation of one or more negative-scoring residue alignments; 
or the end of either sequence is reached. The BLAST algorithm parameters W, T and X 
determine the sensitivity and speed of the alignment. The BLASTN program (for 

20 nucleotide sequences) xises as defaults a wordlength (W) of 1 1, and e:q)ectation (E) of 
10, and the BLOSUM62 scoring matrix (see Henikoffand HenikojBF(1989) Proc. Natl 
Acad Sci USA 89:10915) alignments, (B) of 50, expectation (E) of 10, M=5, N=-4 and 
a comparison of both strands. 

Preferably, the "percentage of sequence identity" is determined by 

25 comparing two optimally aligned sequences over a window of comparison of at least 20 
positions, wherein the portion of the polynucleotide sequence in the comparison 
window may comprise additions or deletions (f.e., gaps) of 20 percent or less, usually 5 
to 15 percent, or 10 to 12 percent, as compared to the reference sequences (which does 
not comprise additions or deletions) for optimal alignment of the two sequmces. The 

30 percentage is calculated by determining the number of positions at v^ch the identical 
nucleic acid bases occius in both sequences to yield the number of matched positions. 
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dividing the number of matched positions by the total number of positions in the 
reference sequence (/.e., the window size) and multiplying the results by 100 to yield the 
percentage of sequence identity. 

It will be appreciated by those of ordinary skill in the art that, as a result 
5 of the degeneracy of the genetic code, there are many nucleotide sequences that encode 
a polypeptide as described herein. Some of these polynucleotides bear miTiiTnal 
homology to the nucleotide sequence of any native gene. Nonetiieless, polynucleotides 
that vary due to differences m codon usage are specificaUy contemplated by tiie present 
invention. Further, alleles of the genes comprising the polynucleotide sequences 

10 provided herein are within the scope of the present invention. Alleles are endogenous 
genes that are altered as a result of one or more mutations, such as deletions, additions 
and/or substitutions of nucleotides. The resulting mRN A and protein may, but need not, 
have an altered structure or function. Alleles may be identified using standard 
techniques (such as hybridization, amplification and/or database sequence comparison). 

15 Therefore, in another embodiment of the invention, a mutagenesis 

approach, such as site-specific mutagenesis, is employed for the preparation of 
immunogenic variants and/or derivatives of the polypeptides described herein. By this 
approach, specific modifications in a polypeptide sequence can be made through 
mutagenesis of the underlying polynucleotides that encode them. These techniques 

20 provides a straightforward approach to prepare and test sequence variants, for example, 
incorporating one or more of the foregoing considerations, by introducing one or more 
nucleotide sequence changes into the polynucleotide. 

Site-specific mutagenesis allows the production of mutants through the 
use of specific oligonucleotide sequences \^ch encode the DNA sequence of the 

25 desired mutation, as well as a sufficient number of adjacent nucleotides, to provide a 
primer sequence of sufficient size and sequence complexity to form a stable duplex on 
both sides of the deletion junction being traversed. Mutations may be employed in a 
selected polynucleotide sequence to improve, alter, decrease, modify, or otherwise 
change the properties of the polynucleotide itself and/or alter the properties, activity, 

30 composition, stability, or primaiy sequence of tiie encoded polypeptide. 
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In certain embodiments of the present invention, the inventors 
contemplate the mutagenesis of the disclosed polynucleotide sequences to alter one or 
more properties of the encoded polypeptide, such as the immunogenicity of a 
polypeptide vaccine. The techniques of site-specific mutagenesis are well-known in the 
5 art, and are widely used to create variants of both polypeptides and polynucleotides. For 
example, site-specific mxitagenesis is often used to alter a specific portion of a DNA 
molecule. In such embodiments, a primer comprising typically about 14 to about 25 
nucleotides or so in length is employed, with about S to about 10 residues on both sides 
of the junction of the sequence being altered. 

10 As will be appreciated by those of skill in the art, site-specific 

mutagenesis techniques have often employed a phage vector that exists in both a single 
stranded and double stranded form. Typical vectors usefiil in site-directed mutagenesis 
include vectors such as &e M13 phage. These phage are readily commercially-available 
and their use is generally well-known to those skilled in the art. Double-stranded 

15 plasmids are also routinely employed in site directed mutagenesis that eliminates the 
step of transferring the gene of interest from a plasmid to a phage. 

In general, site-directed mutagenesis in accordance herewith is 
performed by first obtaining a single-stranded vector or melting apart of two strands of a 
double-stranded vector that includes within its sequence a DNA sequence that encodes 

20 the desired peptide. An oligonucleotide primer bearing the desired mutated sequence is 
prepared, generally synthetically. This primer is then annealed with the single-stranded 
vector, and subjected to DNA polymerizing enzymes such as E. coli polymerase I 
Klenow firagment, in order to complete the synthesis of the mutation-bearing strand. 
Thus, a heteroduplex is formed wherein one strand racodes the original non-mutated 

25 sequence and the second strand bears the desired mutation. This heteroduplex vector is 
then used to transform appropriate cells, such as E. coli cells, and clones are selected 
which include recombinant vectors bearing the mutated sequence arrangement. 

The preparation of sequence variants of the selected peptide-encoding 
DNA segments using site-directed mutagenesis provides a means of producing 

30 potentially usefiil species and is not meant to be limiting as there are other ways in 
which sequence variants of peptides and &e DNA sequences encoding them may be 
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obtained For example, recombinant vectors encoding the desired peptide sequence 
may be treated with mutagenic agents, such as hydroxylamine, to obtain sequence 
variants. Specific details regarding these methods and protocols are found in the 
teachings of Maloy et al, 1994; Segal, 1976; Prokop and Bajpai, 1991; Kuby, 1994; and 
5 Maniatis etal.^l 982, each incorporated herein by reference, for that purpose. 

As used herein, the term "oligonucleotide directed mutagenesis 
procedure" refers to template-dependent processes and vector-mediated propagation 
which result in an increase in the concentration of a specific nucleic acid molecule 
relative to its initial concentration, or in an increase in the concentration of a detectable 
. 10 signal, such as amplification. As used herein, the term ''oligonucleotide directed 
mutageuesis procedure" is intended to refer to a process that involves the 
template-dependent extension of a primer molecule. The tenn template dependent 
process ref^s to nucleic acid synthesis of an RNA or a DNA molecule >?dxerein the 
sequence of the newly synthesized strand of nucleic acid is dictated by the well-known 

15 rules of complementary base pairing (see, for example, Watson, 1987). Typically, 
vector mediated methodologies involve the introduction of the nucleic acid firagment 
into a DNA or RNA vector, the clonal amplification of the vector, and the recovery of 
the amplified nucleic acid firagment Examples of such methodologies are provided by 
U. S. Patent No. 4,237,224, specifically incorporated herein by reference in its entirety. 

20 In another q>proach for the production of polypeptide variants of ftxe 

present invention, recursive sequence recombination, as described in U.S. Pat^t No. 
5,837,458, may be employed. In this approach, iterative cycles of recombination and 
screening or selection are performed to "evolve" individual polynucleotide variants of 
the invention having, for example, enhanced immunogenic activity. 

25 In other embodiments of the present invention, the polynucleotide 

sequences provided herein can be advantageously used as probes or primers for nucleic 
acid hybridization. As such, it is contemplated that nucleic acid segments that comprise 
a sequence region of at least about 15 nucleotide long contiguous sequence that has the 
same sequence as, or is compl^entaiy to, a 15 nucleotide long contiguous sequence 

30 disclosed herein will find particular utility. Longer contiguous identical or 
complementary sequences, e.g., tiiose of about 20, 30, 40, 50, 100, 200, 500, 1000 
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(including all intennediate lengths) and even iq) to full length sequences will also be of 
use in certain embodiments. 

The ability of such nucleic acid probes to specifically hybridize to a 
sequence of interest will enable them to be of use in detecting the presence of 
5 complementary sequences in a given sample. However, other uses are also envisioned, 
such as the use of the sequence information for the preparation of mutant species 
primers, or primers for use in preparing other genetic constructions. 

Polynucleotide molecules having sequence regions consisting of 
contiguous nucleotide stretches of 10-14, 15-20, 30, 50, or even of 100-200 nucleotides 
10 or so (including intermediate lengths as well), identical or complementary to a 
polynucleotide sequence disclosed herein, are particularly contemplated as hybridization 
probes for use in, e,g,. Southern and Northern blotting. This would allow a gene 
product, or fragment thereof, to be analyzed, both in diverse cell types and also in 
various bacterial cells. The total size of fragment, as well as the size of the 
15 complementary stretch(es), will ultimately depend on the intended use or application of 
the particular nucleic acid segment Smaller fragments wiQ generally find use in 
hybridization embodiments, wherein the length of the contiguous complementary region 
may be varied, such as between about 15 and about 100 nucleotides, but larger 
contiguous complementarity stretches may be used, according to the length 
20 complementary sequences one wishes to detect 

Tlie use of a hybridization probe of about 15-25 nucleotides in lengfk 
allows the formation of a dxiplex molecule that is both stable and selective. Molecules 
having contiguous complementary sequences over stretches greater than IS bases m 
length are generally preferred, though, in order to increase stability and selectivity of the 
25 hybrid, and thereby improve the quality and degree of specific hybrid molecules 
obtained. One will generally prefer to design nucleic acid molecules having gene- 
complementary stretches of 15 to 25 contiguous nucleotides, or even longer where 
desired. 

Hybridization probes may be selected from any portion of any of the 
30 sequences disclosed hereiiL All that is required is to review the sequences set fordi 
herein, or to any continuous portion of &e sequences, fix>m about 15-25 nucleotides in 
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length up to and including the full length sequence, that one wishes to utilize as a probe 
or primer. The choice of probe and primer sequences may be governed by various 
factors. For example, one may wish to employ primers from towards the termini of the 
total sequence. 

5 Small polynucleotide segments or fragments may be readily prepared by, 

for example, directly synthesizing the fragment by chemical means, as is commonly 
practiced using an automated oligonucleotide synthesizer. Also, fragments may be 
obtained by application of nucleic add reproduction technology, such as the PGR™ 
technology of U. S. Patent 4,683,202 (incorporated herein by reference), by introducing 

10 selected sequences into recombinant vectors for recombinant production, and by other 
recombinant DNA techniques graerally known to those of skill in the art of molecular 
biology. 

The nucleotide sequences of the invention may be used for their ability to 
selectively form duplex molecules with complementary stretches of the entire gene or 

15 gene fragments of interest. Depending on the application envisioned, one will typically 
desire to employ varying conditions of hybridization to achieve varying degrees of 
selectivity of probe towards target sequence. For applications requiring high selectivity, 
one will typically desire to employ relatively stringent conditions to form the hybrids, 
one will select relatively low salt and/or high temperature conditions, such as 

20 provided by a salt concentration of from about 0.02 M to about O.IS M salt at 
temperatures of from about 50°C to about 70°C. Such selective conditions tolerate 
litde, if any, mismatch between the probe and the template or target strand, and would 
be particularly suitable for isolating related sequences. 

Of course, for some applications, for example, where one desires to 

25 prepare mutants employing a mutant primer strand hybridized to an underlying 
template, less stringent (reduced stringency) hybridization conditions will typically be 
needed in order to allow formation of the heteroduplex. In these circumstances, one 
may desire to employ salt conditions such as those of from about 0.15 M to about 0.9 M 
salt, at temperatures ranging from about 20^*0 to about 55^*0. Cross-hybridizing species 

30 can thereby be readily identified as positively hybridizing signals with respect to control 
hybridizations. In any case, it is generally appreciated that conditions can be rendered 
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more stringent by the addition of increasing amounts of foimamide, which serves to 
destabilize the hybrid duplex in the same manner as increased temperature. Thus, 
hybridization conditions can be readily manipulated, and thus will generally be a 
method of choice depending on the desired results. 
5 According to another embodiment of the presmt invention, 

polynucleotide compositions comprising antisense oligonucleotides are provided 
Antisense oligonucleotides have been d^onstrated to be effective and tai^eted 
inhibitors of protein synthesis, and, consequently, provide a therapeutic ^proach by 
which a disease can be treated by inhibiting the synthesis of proteins that contribute to 

10 the disease. The efBcacy of antisense oligonucleotides for inhibiting protein synthesis 
is well established. For example, the synthesis of polygalactauronase and the muscarine 
type 2 acetylcholine receptor are inhibited by antisense oligonucleotides directed to their 
respective mKNA sequences (U. S. Patent 5,739,119 and U. S. Patent 5,759,829). 
Fvffther, examples of antisense inhibition have been demonstrated witii the nuclear 

15 protein cyclin, the miJtiple drug resistance gene (MDGl), ICAM-1, E-selectin, STK-1, 
striatal GABAa receptor and human EOF (Jaskulski e/ a/.. Science. 1988 Jun 
10;240(4858):1544-6; Vasanthakumar and Ahmed, Cancer Commun. 1989;1(4):225- 
32; Peris et al.. Brain Res Mol Brain Res. 1998 Jun 15;57(2):3 10-20; U. S. Patent 
5,801,154; U.S. Patent 5,789,573; U. S. Patent 5,718,709 and U.S. Patent 5,610.288). 

20 Antisense constructs have also been described that inhibit and can be used to treat a 
variety of abnormal cellular proliferations, e.g. cancer (U. S. Patent 5,747,470; U. S. 
Patent 5,591,317 and U. S. Patent 5.783,683). 

Therefore, in certain embodiments, the present invention provides 
oligonucleotide sequences that comprise all, or a portion o^ any sequence that is 

25 capable of specifically binding to polynucleotide sequence described herein, or a 
complement thereof. In one embodiment, the antisense oligonucleotides comprise DNA 
or derivatives thereof. In another embodiment, the oligonucleotides comprise RNA or 
derivatives thereof. . In a third embodiment, the oligonucleotides are modified DNAs 
comprising a phosphorotbioated modified backbone. In a fourth embodiment, the 

30 oligonucleotide sequences comprise peptide nucleic adds or derivatives thereof. In 
each case, preferred compo^tions comprise a sequence region that is complementary. 
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and more preferably substantially-complementary, and even more preferably, 
completely complementary to one or more portions of polynucleotides disclosed herein. 
Selection of antisense compositions specific for a given gene sequence is based upon 
analysis of the chosen target sequence and determination of secondary structure, Tm, 
5 binding energy, and relative stability. Antisense compositions may be selected based 
upon their relative inability to form dimers, hairpins, or other secondary structures that 
would reduce or prohibit specific binding to the target mRNA in a host cell. Higihly 
preferred target regions of the mRNA, are those which are at or near the AUG 
translation initiation codon, and those sequences which are substantially complementary 

10 to 5' regions of the mSNA. These secondary structure analyses and target site selection 
considerations can be performed, for example, using v.4 of the OLIGO primer analysis 
software and/or the BLASTN 2.0.5 algorithm software (Altschul et aL, Nucleic Acids 
Res. 1997,25(17):3389-402). 

The use of an antisense delivery method employing a short peptide 

15 vector, termed MPG (27 residues), is also contemplated. The MPG peptide contains a 
hydrophobic domain derived from the fiision sequence of HTV gp41 and a hydrophilic 
domain from the nuclear localization sequence of SV40 T-antigen (Morris et aLy 
Nucleic Acids Res. 1997 Jul 15;25(14):2730-6). It has been demonstrated that several 
molecules of the MPG peptide coat the antisense oligonucleotides and can be delivered 

20 into cultured mammalian cells in less than 1 hour with relatively high e^deacy (90%). 
Further, the interaction with MPG strongly increases both the stability of the 
oligonucleotide to nuclease and the ability to cross the plasma membrane. 

According to another embodiment of the invention, the polynucleotide 
compositions described herein are used in tiie design and preparation of ribozyme 

25 molecules for inhibiting expression of the tumor polypeptides and proteins of the 
present invention in tumor cells. Ribozymes are RNA-protein complexes that cleave 
nucleic acids in a site-specLfic fashion. Ribozymes have specific catalytic domains that 
possess endonuclease activity (Kim and Cech, Proc Natl Acad Sci USA. 1987 
Dec;84(24):8788.92; Forster and Symons, CeU. 1987 Apr 24;49(2):211.20). For 

30 example, a large number of ribozymes accelerate phosphoester transfix reactions with a 
high degree of specificity, often cleaving only one of several phosphoesters in an 
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oligonucleotide substrate (Cech et al. Cell. 1981 Dec;27(3 Pt 2):487-96; Michel and 
Westhof, J Mol Biol. 1990 Dec 5;216(3):585-610; Reinhold-Hurek and Shub, Nature. 
1992 May 14;357(6374):173-6). This specificity has been attributed to the requirement 
that the substrate bind via specific base-pairing interactions to the internal guide 
5 sequence ("IGS") of the ribozyme prior to chemical reaction. 

Six basic varieties of naturally-occurring enzymatic RNAs are known 
presently. Each can catalyze the hydrolysis of RNA phosphodiester bonds in trans (and 
thus can cleave other RNA molecules) under physiological conditions. In general, 
^izymatic nucleic acids act by first binding to a target RNA. Such binding occurs 

10 through the target binding portion of a enzymatic nucleic acid vdiich is held in close 
proximity to an enzymatic portion of the molecule that acts to cleave the target RNA. 
Thus, the enzymatic nucleic acid first recognizes and then binds a target RNA through 
complementary base-pairing, and once bound to the correct site, acts enzymatically to 
cut the target RNA. Strategic cleavage of such a target RNA will destroy its ability to 

15 direct synthesis of an encoded protein. After an enzymatic nucleic acid has bound and 
cleaved its RNA target, it is released fi-om that RNA to search for another target and can 
repeatedly bind and cleave new targets. 

The en2ymatic nature of a ribozyme is advantageous over many 
.technologies, such as antisense technology (where a nucleic acid molecule simply binds 

20 to a nucleic acid target to block its translation) since the concentration of ribozyme 
necessaiy to affect a therapeutic treatmaat is lower than that of an antisense 
oligonucleotide. This advantage reflects the ability of &e ribozyme to act 
enzymatically. Thus, a single ribozyme molecule is able to cleave many molecules of 
target RNA. In addition, the ribozyme is a highly specific inhibitor, with the specificity 

25 of inhibition depending not only on the base pairing mechanism of binding to the target 
RNA, but also on the mechanism of target RNA cleavage. Single mismatches, or base- 
substitutions, near the site of cleavage can completely eliminate catalytic activity of a 
ribozyme. Similar mismatches in antisense molecules do not prevent their action 
(Woolf eM/., Proc Natl Acad Sci U S A. 1992 Aug 15;89(16):7305-9). Thus, the 

30 specificity of action of a ribozyme is greater than that of an antisense oligonucleotide 
binding the same RNA site. 
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The enzymatic nucleic acid molecule may be fonned in a hammerhead, 
hairpin, a hepatitis 5 virus, group I intron or RNaseP RNA (in association with an RNA 
guide sequence) or Neurospora VS RNA motif. Examples of hammerhead motifs are 
described by Rossi etal. Nucleic Acids Res. 1992 Sep ll;20(17):4559-65. Examples of 
5 haiipin motifs are described by Hampel et al (Eur. Pat AppL PubL No. EP 0360257), 
Hampel and Tritz, Biochemistry 1989 Jun 13;28(12):4929-33; Hampel etal. Nucleic 
Acids Res. 1990 Jan 25;18(2):299-304 and U. S. Patent 5,631,359. An example of the 
hepatitis 5 virus motif is described by Perrotta and Been, Bioch^nistry. 1992 Dec 
1;3 1(47): 11 843-52; an example of the RNaseP motif is described by Guerrier-Takada 

10 etal.. Cell. 1983 Dec;35(3 Pt 2):849-57; Neurospora VS RNA ribozyme motif is 
described by CoUms (Saville and Collms, Cell. 1990 May 18;61(4):685-96; Saville and 
ColUns, Proc Nati Acad Sci USA. 1991 Oct l;88(19):8826-30; Collms and OUve, 
Biochemistry. 1993 Mar 23;32(ll):2795-9); and an example of the Group I intron is 
described in (U. S. Patent 4,987,071). All that is important in an enzymatic nucleic acid 

15 molecule of this invention is that it has a specific substrate binding site which is 
complementary to one or more of the target gene RNA regions, and that it have 
nucleotide sequences vriMn or surroimding that substrate binding site which impart an 
RNA cleaving activity to the molecule. Thus the ribozyme constructs need not be 
limited to specific moti& mentioned herein. 

20 Ribozymes may be designed as desoibed in Int Pat Appl. Publ. No. 

WO 93/23569 and Int. Pat Appl. Publ. No. WO 94/02595, each specificaUy 
incorporated herein by reference) and syntiiesized to be tested in vitro and in vivo, as 
described. Such ribozymes can also be optimized for delivery. While specific 
examples are provided, those in tiie art will recognize that equivalent RNA targets in 

25 other species can be utilized when necessary. 

Ribozyme activity can be optimized by altering the length of the 
ribozyme binding arms, or chemically synthesizing ribozymes with modifications that 
prevent their degradation by serum ribonucleases (see e,g,, Int Pat Appl. Publ. No. WO 
92/07065; Int PaL Appl. Publ. No. WO 93/15187; Int Pat Appl. PubL No. WO 

30 91/03162; Eur. Pat Appl. Publ. No. 92110298.4; U. S. Patent 5,334,711; and Int PaL 
AppL Publ. No. WO 94/13688, which describe various chemical modifications that can 
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be made to the sugar moieties of enzymatic RNA molecules), modifications which 
enhance their efScacy in cells, and removal of stem II bases to shorten RNA synthesis 
times and reduce chemical requirements. 

Sullivan etal (Int. Pat Appl. Publ. No. WO 94/02595) describes the 
5 general methods for delivery of enzymatic RNA molecules. Ribozymes may be 
administered to cells by a variety of methods known to those familiar to the art, 
including, but not restricted to, encapsulation in liposomes, by iontophoresis, or by 
incorporation into other vehicles, such as hydrogels, cyclode?ctrins, biodegradable 
nanocapsules, and bioadhesive microspheres. For some indications, ribozymes may be 

1 0 directly delivered ex vrvo to cells or tissues with or without the aforementioned vehicles. 
Alternatively, the RNA/vehicle combination may be locally delivered by direct 
inhalation, by direct injection or by use of a catheter, infusion pump or stent. Other 
routes of delivery include, but are not limited to, intravascular, intramuscular, 
subcutaneous or joint injection, aerosol inhalation, oral (tablet or pill form), topical, 

15 systemic, ocular, intraperitoneal and/or intrathecal delivery. More detailed descriptions 
of ribozyme delivery and administration are provided in Int. Pat Appl. Publ. No. WO 
94/02595 and Int Pat Appl. Publ. No. WO 93/23569, each specifically incorporated 
herein by reference. 

Another means of accumulating higih concentrations of a ribozyme(s) 

20 within cells is to incorporate the ribozyme-encoding sequences into a DNA expression 
vector. Transcription of the ribozyme sequences are driven fiom a promoter for 
eukaiyotic RNA polymerase I (pol I), RNA polymerase n (pol II), or RNA polymerase 
in (pol m). Transcripts bom pol n or pol IH promoters will be expressed at high levels 
in all cells; the levels of a given pol n promoter in a given cell type will depend on the 

25 nature of the gene regulatory sequences (enhancers, silencers, etc.) present nearby. 
Prokaryotic RNA polymerase promoters may also be used, providing that the 
prokaryotic RNA polymerase enzyme is expressed in the appropriate cells Ribozymes 
expressed firom such promoters have been shown to fimction in mammalian cells. Such 
transcription units can be incorporated into a variety of vectors for introduction into 

30 mammalian cells, including but not restricted to, plasmid DNA vectors, viral DNA 
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vectors (such as adenovirus or adeno-associated vectors), or viral RNA vectors (such as 
retroviral, semliki forest virus, sindbis virus vectors). 

la another embodiment of the invention, peptide nucleic acids (PNAs) 
compositions are provided, PNA is a DNA mimic in which the nucleobases are 
5 attached to a pseudopeptide backbone (Good and Nielsen, Antisense Nucleic Acid Drug 
Dev. 1997 7(4) 431-37). PNA is able to be utilized in a number methods that 
traditionally have used SNA or DNA. Often PNA sequences perform better in 
techniques than tbe corresponding RNA or DNA sequences and have utilities that are 
not inherent to RNA or DNA. A review of PNA including methods of making, 

10 characteristics o^ and methods of using, is provided by Corey (Trends Biotechnol 1997 
Jui^lS(6):224-9). As such, in certain embodiments, one may prepare PNA sequences 
that are complementary to one or more portions of the ACE mRNA sequence, and such 
PNA compositions may be used to regulate, alter, decrease, or reduce the translation of 
ACE-specific mRNA, and thereby alter the level of ACE activity in a host cell to which 

1 5 such PNA compositions have been administered. 

' PNAs have 2-aminoethyl-glycine linkages replacing the normal 
phosphodiester backbone of DNA (Nielsen et al. Science 1991 Dec 6;254(5037):1497- 
500; Hanvey et al. Science. 1992 Nov 27;258(5087):1481-5; Hyrup and Nielsen, 
Bioorg Med Chem. 1996 Jan;4(l):S-23). This chemistry has three important 

20 consequences: firstly, in contrast to DNA or phosphorothioate oligonucleotides, PNAs 
are neutral molecules; secondly, PNAs are achiral, \^ch avoids the need to develop a 
stereoselective synthesis; and thirdly, PNA syndesis uses standard Boc or Fmoc 
protocols for solid-phase peptide synthesis, although o&er methods, including a 
modified Menifield method, have been used. 

25 PNA monomers or ready-made oligomers are commercially available 

firom PerSeptive Biosystems (Framingham, MA), PNA syntheses by either Boc or 
Fmoc protocols are straightforward using manual or automated protocols (Norton et al, 
Bioorg Med Chem. 1995 Apr,3(4):437-45). The manual protocol lends itself to the 
production of chemically modified PNAs or the simultaneous synthesis of families of 

30 closely related PNAs. 
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As with peptide synthesis, the success of a particular PNA synthesis will 
depend on the properties of the chosen sequence. For example, while in theory PNAs 
can incorporate any combination of nucleotide bases, the presence of adjacent purines 
can lead to deletions of one or more residues in the product In expectation of this 
5 diflBculty, it is suggested that, in producing PNAs with adjacent purines, one should 
repeat the coupling of residues likely to be added inefSciently. This ^ould be followed 
by the purification of PNAs by reverse-phase high-pressure liquid chromatography, 
providing yields and purity of product similar to those observed during the synthesis of 
peptides. 

10 Modifications of PNAs for a given application may be accomplished by 

coupling amino acids during solid-phase synthesis or by attaching compoimds that 
contain a carboxylic acid group to the exposed N-terminal amine. Alternatively^ PNAs 
can be modified after syndiesis by coupling to an introduced lysine or cysteine. The 
ease with which PNAs can be modified facilitates optimization for better solubility or 

15 for specific functional requirements. Once synthesized, the identity of PNAs and their 
derivatives can be confirmed by mass spectrometry. Several studies have made and 
utilized modifications of PNAs (for example, Norton et al, Bioorg Med Chem. 1995 
Apr;3(4):437-45; Petersen et al, J Pept Sci. 1995 May-Jun; 1(3): 175-83; Orum et al, 
Biotechniques. 1995 Sep;19(3):472-80; Footer et a/., Biodiemistry. 1996 Aug 

20 20;35(33):10673-9; Griffith et al. Nucleic Acids Res. 1995 Aug ll;23(15):3003-8; 
Pardridge et al, Proc Natl Acad Sci USA. 1995 Jun 6;92(12):5592-6; Bofife et al, 
Proc Natl Acad Sci U S A. 1995 Mar 14;92(6):1901-5; Gambacorti-Passerini et al. 
Blood. 1996 Aug 15;88(4):1411-7; Annitage et al, Proc Natl Acad Sci U S A. 1997 
Nov ll;94(23):12320-5; Seeger et al., Biotechniques. 1997 Sep;23(3):512-7). U.S. 

25 Patent No. 5,700,922 discusses PNA-DNA-PNA chimeric molecules and their uses in 
diagnostics, modulating protein in organisms, and treataient of conditions susceptible to 
therapeutics. 

Methods of characterizing the antisense binding properties of PNAs are 
discussed in Rose (Anal Chem. 1993 Dec 15;65(24):3545-9) and Jensen et aL 
30 (Biochemistry. 1997 Apr 22;36(16):5072-7). Rose uses capillary gel electrophoresis to 
determine binding of PNAs to their complementary oligonucleotide, measuring tibie 
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relative binding kinetics and stoichiometry. Similar types of measurements were made 
by Jensen et al using BIAcore™ technology. 

Other applications of PNAs that have been described and will be 
apparent to the skilled artisan include use in DNA strand invasion, antisense inhibition, 
5 mutational analysis, enhancers of transcription, nucleic acid purification, isolation of 
transcriptionally active genes, blocking of transcription factor bindii^ genome 
cleavage, biosensors, in situ hybridization, and the like. 

Polynucleo tide Tdent jfication. Characterizatibn and Expression 

Polynucleotides compositions of fbe present invention may be identified, 

10 prepared and/or manipulated using any of a variety of well established techniques ^see 
generally, Sambrook et al.. Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratories, Cold Spring Harbor, NY, 1989, and other like references). For 
example, a polynucleotide may be identified, as described in more detail below, by 
screening a microarray of cDNAs for tumor-associated expression (/.e., expression that 

15 is at least two fold greater in a tumor than in normal tissue, as determined using a 
representative assay provided herein). Such screens may be performed, for example, 
using the microarray technology of Af^etrix, Inc. (Santa Clara, CA) according to the 
manufacturer's instructions (and essentially as described by Schena et al., Proc. Natl 
Acad. Set USA 93:10614-10619, 1996 and Heller et al., Proc. Natl Acad, Set USA 

20 P^:2150-2155, 1997). Alternatively, polynucleotides may be amplified j&om cDNA 
prepared fix>m cells expressing the proteins described herein, such as tumor cells. 

Many template dependent processes are available to amplify a target 
sequences of interest present in a sample. One of the best known amplification methods 
is the polymerase chain reaction (PCR™) which is described in detail in U.S. Patent 

25 Nos. 4,683,195, 4,683,202 and 4,800,159, each of which is incorporated herein by 
reference in its entirety. Briefly, in PCR™, two primer sequences are prepared vMch 
are complementary to regions on opposite complementary strands of the target 
sequence. An excess of deoxynucleoside triphosphates is added to a reaction mixture 
along with a DNA polymerase (e.g., Tag polymerase). If the target sequence is present 

30 in a sample, the primers will bind to the target and the polymerase will cause the 
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primers to be extended along the target sequence by adding on nucleotides. By raising 
and lowering the temperature of the reaction mixture, the extended primers will 
dissociate from the target to form reaction products, excess primers will bind to tiie 
target and to the reaction product and the process is repeated. Preferably reverse 
5 transcription and PCR™ amplification procedure may be performed in order to quantify 
the amount of mRNA amplified. Polymerase chain reaction methodologies are well 
known in the art 

Any of a number of other template dependent processes, many of which 
are variations of flie PCR ™ amplification technique, are readily known and available in 

10 the art niustradvely, some such methods include the ligase chain reaction (referred to 
as LCR), described, for example, in Eur. Pat AppL Publ. No. 320,308 and U.S. Patent 
No. 4,883,750; Qbeta Replicase, described in PCT Intl. Pat AppL Publ. No. 
PCT/US87/00880; Strand Displacement Amplification (SDA) and Repair Chain 
Reaction (RCR). Still other amplification methods are described in Great Britain Pat 

15 AppL No. 2 202 328, and in PCT Intl. Pat AppL Publ. No. PCT/US89/01025. Other 
nucleic acid amplification procedures include transcription-based amplification systems 
(TAS) (PCT Intl. Pat AppL Publ. No. WO 88/10315), including nucleic acid sequence 
based amplification (NASBA) and 3SR. Eur. Pat AppL PubL No. 329,822 describes a 
nucleic acid amplification process involving cyclically synthesizing single-stranded 

20 RNA ("ssRNA"), ssDNA, and'double-stranded DNA (dsDNA). PCT Intl. Pat. AppL 
PubL No. WO 89/06700 describes a nucleic acid sequence amplification scheme based 
on the hybridization of a promoter/primer sequence to a target single-stranded DNA 
C'ssDNA") followed by transcription of many RNA copies of the sequence. Other 
amplification mefliods such as "RACE" (Frohman, 1990), and "one-sided PCR" (Ohara, 

25 1 989) are also well-known to those of skill in the art. 

An amplified portion of a polynucleotide of the present invention may be 
used to isolate a fiill length gene from a suitable library {e.g., 2i tumor cDNA library) 
using well known techniques. Within such techniques, a library (cDNA or genomic) is 
screened using one or more polynucleotide probes or primers suitable for amplification. 

30 Preferably, a library is size-selected to include larger molecules. Random primed 
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libraries may also be prefened for identifying 5* and upstream regions of genes. 
Genomic libraries are preferred for obtaining introns and extending 5' sequences. 

For hybridization techniques, a partial sequence may be labeled (e.g., by 
nick-translation or end-labeling with ^^P) using well known techniques, A bacterial or 
5 bacteriophage library is then generally screened by hybridi2dng jfilters containing 
denatured bacterial colonies (or lawns containing phage plaques) with the labeled probe 
(see Sambrook et al.. Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratories, Cold Spring Harbor, NY, 1989). Hybridizing colonies or plaques are 
selected and expanded, and the DNA is isolated for further analysis. cDNA clones may 

10 be analyzed to determine the amount of additional sequence by, for example, PCR using 
a primer firom the partial sequence and a primer firom the vector. Restriction maps and 
partial sequences may be generated to identify one or more overlapping clones. The 
complete sequence may then be determined using standard techniques, which may 
involve generating a series of deletion clones. The resulting overlapping sequences can 

15 then assembled into a single contiguous sequence. A fiill length cDNA molecule can be 
generated by ligating suitable fragments, using well known techniques. 

Alternatively, amplification techniques, such as those described above, 
can be useful for obtaining a full length coding sequence £rom a partial cDNA sequence. 
One sudi amplification techmque is inverse PCR {see Triglia et al., NucL Acids Res. 

20 75:8186, 1988), ^ch uses restriction enzymes to generate a fragment in the known 
region of the gene. The fragment is then circularized by intramolecular ligation and 
used as a template for PCR with divergent primers derived from the known region. 
Within an alternative approach, sequences adjacent to a partial sequmce may be 
retrieved by amplification with a primer to a linker sequence and a primer specific to a 

25 known region. The amplified sequences are typically subjected to a second round of 
amplification with the same linker primer and a second primer specific to the known 
region. A variation on this procedure, which employs two primers that initiate 
extension in opposite directions from the known sequence, is described in WO 
96/38591. Another such techmque is known as "rapid amplification of cDNA ends" or 

30 RACE. This technique involves the use of an internal primer and an external primer, 
which hybridizes to a polyA region or vector sequence, to identify sequences that are 5* 
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and 3' of a known sequence. Additional techniques include capture PCR (Lagerstrom et 
3l., PCR Methods Applia 7:111-19, 1991) and walldng PCR (Parker etal.,JVwc/.i4d£fe. 
Res, 7P:3055-60, 1991). Other methods employing amplification may also be employed 
to obtain a full length cDNA sequence. 
5 In certain instances, it is possible to obtain a full length cDNA sequence 

by analysis of sequences provided in an expressed sequence tag (EST) database, such as 
tiiat available from GenBank. Searches for overlapping ESTs may generally be 
performed using well known programs (e.g., NCBI BLAST searches), and such ESTs 
may be used to generate a contiguous full length sequence. Full length DNA sequences 

10 may also be obtained by analysis of genomic fragments. 

In other embodiments of the invention, polynucleotide sequences or 
fragments thereof which encode polypeptides of the invention, or fusion proteins or 
functional equivalents thereof, may be used in recombinant DNA molecules to direct 
expression of a polypeptide in appropriate host cells. Due to the inherent degeneracy of 

15 the genetic code, other DNA sequences that encode substantially the same or a 
functionally equivalent amino acid sequence may be produced and these sequences may 
be used to clone and express a given polypeptide. 

As will be understood by those of skill in the art, it may be advantageous 
in some instances to produce polypeptide-encoding nucleotide sequences possessing 

20 non-naturally occurring codons. For example, codons preferred by a particular 
prokacyotic or eukaryotic host can be selected to increase the rate of protein expression 
or to produce a recombinant KNA transcript having desirable prop^es, such as a half- 
life which is longer than that of a transcript generated from the naturally occurring 
sequence. 

25 Moreover, the polynucleotide sequences of the present invention can be 

engineered using methods generally known in the art in order to alter polypeptide 
encoding sequences for a variety of reasons, including but not limited to, alterations 
which modify the cloning, processing, and/or expression of the gene product For 
example, DNA shufQing by random fragmentation and PCR reassembly of gene 

30 fragments and synthetic oligonucleotides may be used to en^eer the nucleotide 
sequences. In addition, site-directed mutagenesis may be used to insert new restriction 
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sites» alter glycosylation patterns, change codon preference, produce splice variants, or 
introduce mutations, and so forth. 

In another embodiment of the invention, natural, modified, or 
recombinant nucleic acid sequences may be ligated to a heterologous sequence to 
5 . encode a fusion protein. For example, to screen peptide libraries for inhibitors of 
polypeptide activity, it may be useful to encode a chimeric protein that can be 
recognized by a commercially available antibody. A fusion protein may also be 
engineered to contain a cleavage site located betwem the polypeptide-encoding 
sequence and the heterologous protein sequence, so that the polypeptide may be cleaved 

1 0 and purified away firom the heterologous moiety. 

Sequences encoding a desired polypeptide may be synthesized, in whole 
or in part, using chemical methods well known in the art (see Carutfaers, M. H. et al. 
(1980) Nucl Acids Res, Symp, Ser. 215-223, Horn, T. et al. (1980) NucL Acids Res. 
Symp, Ser. 225-232). Alternatively, the protein itself may be produced using chemical 

15 methods to synthesize the amino acid sequence of a polypeptide, or a portion thereof. 
For example, peptide synthesis can be performed using various solid-phase techniques 
(Roberge, J. Y. et al. (1995) Science 269:202-204) and automated synthesis may be 
achieved, for example, using the ABI 431 A Peptide Synthesizer (Perkin Ehner, Palo 
Alto,CA). 

20 A newly synthesized peptide may be substantially purified by preparative 

high performance liquid chromatography (e.g.^ Creighton, T. (1983) Proteins, Structures 
and Molecular Principles, WH Freeman and Co., New York, N.Y.) or other comparable 
techniques available in the art. The composition of the synthetic peptides may be 
confirmed by amino add analysis or sequencing (e,g.^ the Edman degradation 

25 procedure). Additionally, the amino acid sequence of a polypeptide, or any part thereof, 
may be altered during direct synthesis and/or combined using chemical methods with 
sequences fi-om other proteins, or any part thereof, to produce a variant polypeptide. 

In order to express a desired polypeptide, the nucleotide sequences 
encoding the polypeptide, or functional equivalCTts, may be inserted into appropriate 

30 expression vector, z.a., a vector which contains the necessary elements for the 
transcription and translation of the inserted coding sequence. Methods which are well 
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knov/n to those skilled in the art may be used to construct expression vectors containing 
sequences encoding a polypeptide of interest and appropriate transcriptional and 
translational control elements. These methods include in vitro recombinant DNA 
techniques, synthetic techniques, and in vivo genetic recombination. Such techniques 
5 are described, for example, in Sambrook, J. et al. (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. M. et 
aL (1989) Current Protocols in Molecular Biology, John Wiley & Sons, New York. 
N.Y. 

A variety of expression vector/host systems may be utilized to contain 

10 and express polynucleotide sequences. These include, but are not limited to, 
microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, 
or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; 
insect cell systems infected with virus expression vectors (e.g., baculovims); plant cell 
systems transformed with virus expression vectors (e.g., cauliflower mosaic virus, 

15 CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or 
pBR322 plasmids); or animal cell systems. 

The "control elements" or "regulatory sequences" present in an 
expression vector are those non-translated regions of the vector-enhancers, promoters, 
5' and 3' untranslated regions-which interact with host cellular proteins to carry out 

20 transcription and translation. Such elements may vary in their strength and specificity. 
Depending on the vector system and host utilized, any number of suitable transcription 
and translation elements, including constitutive and inducible promoters, may be used. 
For example, when cloning in bacterial systems, inducible promote such as the hybrid 
lacZ promoter of the PBLUESCRPT phagemid (Stratagene, La Jolla, Calif.) or 

25 PSPORTl plasmid (Gibco BRL, Gaithersburg, MD) and the like may be used. In 
mammalian cell systems, promoters from mammalian genes or from mammahan viruses 
are generally preferred. If it is necessary to generate a cell line that contains multiple 
copies of the sequence encoding a polypeptide, vectors based on SV40 or EBV may be 
advantageously used witii an appropriate selectable marker. 

30 In bacterial syst^ns, any of a number of expression vectors may be 

selected depending iq>on the use intmded for the expressed polypqptide. For example. 
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when large quantities are needed, for example for the induction of antibodies, vectors 
which direct high level expression of fusion proteins that are readily purified may be 
xised. Such vectors include, but are not limited to, the multifunctional E. coli cloning 
and expression vectors such as BLUESCRIPT (Stratagene), in which the sequence 
5 encoding the polypeptide of interest may be ligated into the vector in firame with 
sequences for the amino-terminal Met and the subsequent 7 residues of .beta.- 
galactosidase so that a hybrid protein is produced; pIN vectors (Van Heeke, G. and S. 
M. Schuster (1989) J. Biol Chem, 26^:5503-5509); and the like. pGEX Vectors 
(Promega, Madison, Wis.) may also be used to express foreign polypeptides as fusion 

10 proteins with glutathione S-transferase (GST). In general, such fusion proteins are 
soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose 
beads followed by elution in the presence of free glutathione. Proteins made in such 
systems may be designed to include heparin, thrombin, or factor XA protease cleavage 
sites so that the cloned polypeptide of interest can be released from the GST moiety at 

15 will. 

In the yeast, Saccharomyces cerevisiae, a number of vectors containing 
constitutive or inducible promoters such as alpha &ctor, alcohol oxidase, and PGH may 
be used. For reviews, see Ausubel et al. (supra) and Grant et al. (1987) Methods 
Enzymol 153:516-544. 

20 In cases where plant expression vectors are used, the expression of 

sequences encoding polypeptides may be driven by any of a number of promoters. For 
example, viral promoters such as the 35S and 19S promoters of CaMV may be used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. 
(1987) EMBO J. 5:307-31 1. Alternatively, plant promoters such as flie small subunit of 

25 RUBISCO or heat shock promoters may be used (Coruzzd, G. et al. (1984) EA4B0 J. 
5:1671-1680; Broglie, R. et al. (1984) Science 22^:838-843; and Winter, J. et al. (1991) 
Results Probl. Cell EHffer, 77:85-105). These constructs can be introduced into plant 
cells by direct DNA transformation or pathogen-mediated transfection. Such techniques 
are desaibed in a number of generally available reviews (see, for example, Hobbs, S. or 

30 Muny, L. E. in McGraw Hill Yearbook of Science and Technology (1992) McGraw 
HiU, New York, N.Y.; pp. 191-196). 
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An insect system may also be used to express a polypeptide of interest 
For example, in one such system, Autographa califomica nuclear polyhedrosis virus 
(AcNPV) is used as a vector to express foreign genes in Spodoptera frugiperda cells or 
in Trichoplusia larvae. The sequences encoding the polypeptide may be cloned into a 
5 non-essential region of the virus, such as the polyhedrin gene, and placed under control 
of the polyhedrin promoter. Successful insertion of the polypeptide-encoding sequence 
will render the polyhedrin gene inactive and produce recombinant virus lacking coat 
protein. The recombinant viruses may then be used to infect, for example, S. frugiperda 
cells or Trichoplusia larvae in which the polypeptide of interest may be expressed 

10 (Engelhard, K K. et al, (1994) Proc. Natl Acad. Scl 91 :3224.3227). 

In mammalian host cells, a number of vu^-based expression systems are 
generally available. For example, in cases where an adenovmis is used as an expression 
vector, sequences encoding a polypeptide of interest may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and tripartite leader 

15 sequence. Insertion in a non-essential El or E3 region of the viral genome may be used 
to obtain a viable virus which is capable of expressing the polypeptide in infected host 
cells (Logan, L and Shenk, T. (1984) Proc. Natl Acad. Sci 57:3655-3659). In addition, 
transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used 
to increase expression in mammalian host cells. 

20 Specific initiation signals may also be used to achieve more efiScient 

translation of sequences encoding a polypeptide of interest Such signals include the 
ATG initiation codon and adjacent sequences. In cas^ vAere sequences encoding the 
polypeptide, its initiation codon, and upstream sequences are inserted into die 
appropriate expression vector, no additional transcriptional or translational control 

25 signals may be needed. However, in cases where only coding sequence, or a portion 
thereof, is inserted, exogenous translational control signals including the ATG initiation 
codon should be provided. Furthermore, the initiation codon should be in the correct 
reading frame to ensure translation of the entire insert Exogenous translational 
elements and initiation codons may be of variotis origins, both natural and synthetic. 

30 The efGciency of expression may be enhanced by the inclusion of enhancers which are 
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appropriate for the pardcidar cell system which is used, such as those described in the 
literature (Scharf, D. et al. (1994) Residts Probl Cell Differ. 20:125-162). 

In addition, a host cell strain may be chosen for its ability to modulate 
the expression of the inserted sequences or to process the expressed protein in the 
5 desired fashion. Such modifications of the polypeptide include, but are not limited to, 
acetylation, carboxylatioiL glycosylation, phoq)horylation, lipidation, and acylation. 
Post-translational processing which cleaves a "prepro*^ form of the protein may also be 
used to facilitate correct insertion, folding and/or function. Different host cells such as 
CHO, COS, HeLa, MDCK, HEK293, and WD8, which have specific cellular machinery 

10 and characteristic mechanisms for such post-translational activities, may be chosen to 
ensure the correct modification and processing of the foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 
expression is generally preferred. For example, cell lines which stably express a 
polynucleotide of interest may be transformed using expression vectors which may 

15 contain viral origins of replication and/or endogenous expression elements and a 
selectable marker gene on the same or on a separate vector. Following the introduction 
of the vector, cells may be allowed to grow for 1-2 days in an enriched media before 
they are switched to selective media. The purpose of Ihe selectable marker is to confer 
resistance to selection, and its presence allows growth and recovery of cells which 

20 successfully express the introduced sequences. Resistant clones of stably transformed 
cells may be proliferated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed 
cell lines. These include, but are not limited to, the herpes simplex virus thymidine 
kinase (Wigler, M. et al. (1977) Cell ii:223-32) and adenine phosphoribosyltransferase 

25 (Lowy, I. et al. (1990) Cell 22:817-23) genes which can be employed in tksup.- or 
aprt.sup.- cells, respectively. Also, antimetabolite, antibiotic or herbicide resistance can 
be used as the basis for selection; for example, dhfi: which confers resistance to 
methotrexate (Wigler, M. et al. (1980) Proc. Natl Acad Set 77:3567-70); npt, which 
conf^ resistance to the aminoglycosides, neomycin and G-418 (Colbere-Garapin, F. et 

30 al (1981) J. Mol BioL iJ0:l-14); and als or pat, which confer resistance to 
chlorsulfuron and phosphinotricin acetyltransferase, respectively (Muny, stpra). 
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Additional selectable genes have been described, for example, trpB, which allows cells 
to utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol in 
place of histidine (Hartman, S. C. and IL C. Mulligan (1988) Proc, Natl Acad Set 
55:8047-51). The use of visible markers has gained popularity with such markers as 
5 anthocyanins, beta-glucuronidase and its substrate GUS, and luciferase and its substrate 
luciferin, being widely used not only to identify transfonnants, but also to quantify the 
amount of transient or stable protein expression attributable to a specific vector system 
(Rhodes, C A. et al. (1995) Methods Mol Biol 55:121-131). 

Although tiie presence/absence of marker gene expression suggests that 

10 the gene of interest is also preset, its presence and expression may need to be 
confirmed. For example, if the sequence encoding a polypeptide is inserted within a 
marker gene sequence, recombinant cells containing sequences can be identified by the 
absence of marker gene function. Alternatively, a marker gene can be placed in tandem 
with a polypeptide-encoding sequence xmder tiie control of a single promoter, 

15 Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

Alternatively, host cells that contain and express a desired 
. polynucleotide sequence may be id^tified by a variety of procedures known to those of 
skill in the art These procedures include, but are not lunited to, DNA-DNA or DNA- 

20 KNA hybridizations and protein bioassay or immunoassay techniques which include, 
for example, membrane, solution, or chip based technologies for the detection and/or 
quantification of nucleic acid or protein. 

A variety of protocols for detecting and measuring the expression of 
polynucleotide-encoded products, using either polyclonal or monoclonal antibodies 

25 specific for the product are known in the art Examples include enzyme-linked 
immunosorbent assay (ELISA), radioimmunoassay (RIA), and fluorescence activated 
cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on a given polypeptide may be 
preferred for some applications, but a competitive binding assay may also be employed. 

30 These and other assays are described, among other places, in Hampton, R. et al. (1990; 
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Serological Methods, a Laboratory Manual, APS Press, St Paul. Minn.) and Maddox, D. 
E.etal.(1983;7.£xp. Afed 755:1211-1216). 

A wide variety of labels and conjugation techniques are known by those 
skilled in the art and may be lised in various nucleic acid and anuno acid assays. Means 
5 for producing labeled hybridization or PGR probes for detecting sequences related to 
polynucleotides include oligolabeling, nick translation, end-labeling or PGR 
amplification using a labeled nucleotide. Alternatively, the sequences, or any portions 
thereof may be cloned into a vector for the production of an mRNA probe. Such vectors 
are known in the art, are commercially available, and may be used to synthesize RNA 

10 probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 
and labeled nucleotides. These procedures may be conducted using a variety of 
conmiercially available kits. Suitable reporter molecules or labels, which may be used 
include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents 
as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

15 Host cells transformed with a polynucleotide sequence of interest may be 

cultured under conditions suitable for the expression and recovery of the protein from 
cell culture. The protein produced by a recombinant cell may be secreted or contained 
intracellularly depending on the sequence and/or the vector used. As will be understood 
by those of skill in the art, expression vectors containing polynucleotides of the 

20 invention may be designed to contain signal sequences which direct secretion of the 
encoded polypeptide through a prokaiyotic or eukaryotic cell membrane. Other 
recombinant constructions may be used to join sequences encoding a polypeptide of 
interest to nucleotide sequence encoding a polypeptide domain which will facilitate 
purification of soluble proteins. Such piuification facilitating domains include, but are 

25 not limited to, metal chelating peptides such as histidine-tiyptophan modules that allow 
purification on immobilized metals, protein A domains that allow purification on 
immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity 
purification system (Immunex Corp., Seattle, Wash.), The inclusion of cleavable linker 
sequences such as those specific for Factor XA or enterokinase (Invitrogen. San Diego, 

30 Calif.) between the purification domain and the encoded polypeptide may be used to 
&cilitate purification. One such expression vector provides for expression of a fiision 
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protein containing a polypeptide of interest and a nucleic acid encoding 6 histidine 
residues preceding a tbioredoxin or an enterokinase cleavage site. The histidine residues 
facilitate purification on IMIAC (immobilized metal ion afSnity chromatography) as 
described in Porath, J. et al. (1992, Prot Exp. Purif 5:263-281) while the enterokinase 
5 cleavage site provides a means for purifying the desired polypeptide from the fusion 
protein. A discussion of vectors which contain fusion protems is provided m Kroll, D. J. 
et al. (1993; DNA CellBioL i2:44M53), 

In addition to recombinant production methods, polypeptides of the 
invention, and fragments thereof, may be produced by direct peptide synthesis using 

10 solid-phase techniques (Meirifield J. (1963) 1 Am, Chem. Soc. 55:2149-2154). Protein 
synthesis may be performed using manual techniques or by automation. Automated 
synthesis may be achieved, for example, using Applied Biosystems 431A Peptide 
Synthesizer (Perkin Elmer). Alternatively, various fragments may be chemically 
synthesized separately and combined using chemical methods to produce the full length 

15 molecule. 

Antibodv Compositions, Fragments Thereof and Other Binding Agents 

According to another aspect, the present invention further provides 
binding agents, such as antibodies and antigen-binding fiiagments thereof that exhibit 
immunological binding to a tumor polypeptide disclosed herein, or to a portion, variant 

20 or derivative tiiereof. An antibody, or antigen-bmding fragment thereof is said to 
"specifically bind,'* ''immimogically bind," and/or is "immunologically reactive" to a 
polypeptide of the invention if it reacts at a detectable level (within, for example, an 
EIISA assay) with the polypeptide, and does not react detectably with unrelated 
polypeptides under similar conditions. 

25 Immunological binding, as used in this context, generally refers to the 

non-covalent interactions of the type which occur between an immunoglobulin 
molecule and an antigen for which the immunoglobulin is specific. The strength, or 
afSnity of immunological binding interactions can be expressed in terms of the 
dissociation constant (K<i) of the interaction, herein a smaller Kd represents a greater 

30 affinity. Immunological binding properties of selected polypeptides can be quantified 
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using methods well known in the art One such me&od entails measuring the rates of 
antigen-binding site/antigen complex formation and dissociation, herein tiiose rates 
depend on the concentrations of the complex partners, the afiSnity of the interaction, and 
on geometric parameters that equally influence the rate in both directions. Thus, both 
5 the "on rate constant" (Kon) and the "off rate constant" (Ko^ can be determined by 
calculation of the concentrations and the actual rates of association and dissociation. 
The ratio of Koff /Kon enables cancellation of all parameters not related to afSnity, and is 
thus equal to the dissociation constant Kd. See, generally, Davies et al. (1990) Annual 
Rev. Biochem. 59:439-473. 

10 An "antigen-binding site," or "binding portion" of an antibody refers to 

the part of the inmmnoglobulin molecule that participates in antigen binding. The 
antigen binding site is formed by amino acid residues of the N-terminal variable ("V") 
regions of the heavy ("H") and light ("L") chains. Three highly divergent stretches 
within the V regions of the heavy and light chains are referred to as 'Tiypervariable 

15 regions" which are interposed between more conserved flanking stretches known as 
"jframework regions," or "FRs". Thus the term "FR" refers to amino acid sequences 
which are naturally found between and adjacent to hypervariable regions in 
immunoglobulins. In an antibody molecule, the three hypervariable regions of a light 
chain and the three hypervariable regions of a heavy chain are disposed relative to each 

20 other in three dimensional space to form an antigen-binding sur&ce. The antigen- 
binding surface is complementary to the three-dimensional surface of a bound antigen, 
and the three hypervariable regions of each of the heavy and light chains are referred to 
as "complementarity-determining regions," or "CDRs." 

Binding agents may be ftnrther capable of differentiating between patients 

25 with and without a cancer, such as ovarian cancer, using the representative assays 
provided herein. For example, antibodies or other binding agents that bind to a tumor 
protein will preferably generate a signal indicating the presence of a cancer in at least 
about 20% of patients with the disease, more prefCTably at least about 30% of patients. 
Alternatively, or in addition, the antibody will generate a negative signal indicating the 

30 absence of the disease in at least about 90% of individuals without the cancer. To 
determine whether a binding agent satisfies fliis requirement, biological samples (e.g.. 
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blood, sera, sputum, urine and/or tumor biopsies) from patients with and without a 
cancer (as determined using standard clinical tests) may be assayed as described herein 
for the presence of polypeptides that bind to the binding agent Preferably, a statistically 
significant number of samples with and without the disease will be assayed. Each 
5 binding agent should satisfy the above criteria; however, those of ordinary skill in the 
art wiU recognize that binding agents may be used in combination to improve 
sensitivity. 

Any agent that satisfies the above requirements may be a binding agent 
For example, a binding agent may be a ribosome, with or without a peptide component, 

10 an RNA molecule or a polypeptide. In a preferred embodiment, a binding agent is an 
antibody or an antigen-binding fragment thereof. Antibodies may be prepared by any of 
a variety of techniques known to those of ordinary skill in the ait See, e.g„ Harlow and 
Lane, Antibodies: A Laboratory Manual^ Cold Spring Harbor Laboratory, 1988. In 
general, antibodies can be produced by cell culture techniques, including the generation 

15 of monoclonal antibodies as described herein, or via transfection of antibody genes into 
suitable bacterial or mammalian cell hosts, in order to allow for the production of 
recombinant antibodies. In one technique, an immunogen comprising the polypeptide is 
initially injected into any of a wide variety of mammals {e.g., mice, rats, rabbits, sheep 
or goats). In this step, the polypeptides of this invention may s^ve as the inummogen 

20 without modification. Alternatively, particularly for relatively short polypeptides, a 
superior immime response may be elicited if the polypeptide is joined to a carrier 
protein, such as bovine serum albumin or keyhole limpet hemocyanin. The iimmmogen 
is injected into the animal host, preferably according to a predetermined schedule 
incorporating one or more booster immunizations, and the animals are bled pCTodically. 

25 Polyclonal antibodies specific for the polypeptide may then be purified from such 
antisera by, for example, afl5nity chromatography using the polypeptide coupled to a 
suitable solid support. 

Monoclonal antibodies specific for an antigenic polypeptide of interest 
may be prepared, for example, using the technique of Kohler and Milstein, Eur, J. 

30 Immunol (JrSll-SlP, 1976, and improvements thereto. Briefly, these mefiiods involve 
the preparation of immortal cell lines capable of producing antibodies having the 
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desired specificity (/.e., reactivity with the polypeptide of interest). Such cell lines may 
be produced, for example, &om spleen cells obtained from an animal immunized as 
described above. The spleen cells are then immortalized by, for example, fusion with a 
myeloma cell fusion partner, preferably one that is syngeneic with the immunized 
5 animal. A variety of fusion techniques may be employed. For example, the spleen cells 
and myeloma cells may be combined with a nonionic detergent for a few minutes and 
then plated at low density on a selective medium that siq>ports the growdi of hybrid 
cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthine, 
aminopterin, thymidine) selection. After a sufGcient time, usually about 1 to 2 weeks, 

10 colonies of hybrids are observed. Single colonies are selected and their culture 
. supematants tested for binding activity against the polypeptide. Hybridomas having 
high reactivity and specificity are preferred. 

Monoclonal antibodies may be isolated from the supematants of growing 
hybridoma colonies. In addition, various techniques may be employed to enhance the 

15 yield, such as injection of tiie hybridoma cell line into the peritoneal cavity of a suitable 
vertebrate host, such as a mouse. Monoclonal antibodies may then be harvested from 
the ascites fluid or the blood. Contaminants may be removed from the antibodies by 
conventional techniques, such as chromatography, gel filtration, precipitation, and 
extraction. The polypeptides of this invention may be used in the purification process 

20 in, for example, an afBnity chromatography step. 

A number of thempeutically useful molecules are known in the art which 
comprise antigen-binding sites that are capable of exhibiting immunological binding 
properties of an antibody molecule. The proteolytic enzyme papain preferentially 
cleaves IgG molecules to yield several fragments, two of which (the "F(ab)" fragments) 

25 each comprise a covalent heterodimer that includes an intact antigen-binding site. The 
enzyme pepsin is able to cleave IgG molecules to provide several fragments, including 
the "F(ab*)2 " fragment which comprises both antigen-binding sites. An "Fv" fii^gment 
can be produced by preferential proteolytic cleavage of an IgM, and on rare occasions 
IgG or IgA immuno^obulin molecule. Fv fi:agments are, however, more commonly 

30 derived using recombinant techniques known in the art The Fv fragment includes a 
non-covalent Vh-Vl heterodimer including an antigen-binding site which retains much 
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of the antigen recognition and binding capabilities of the native antibody molecule. 
Inbar et al. (1972) Proc. Nat. Acad. Sci. USA 69:2659-2662; Hochman et al. (1976) 
Biochem 15:2706-2710; and EhrUch et al. (1980) Biochem 19:4091-4096. 

A single chain Fv ("sFv") polypeptide is a covalently linked Vh::Vl 
5 heterodimer which is expressed from a gene fusion including Vh- and VL-encoding 
genes Imked by a peptide-encoding linker. Huston et al. (1988) Proc. Nat Acad, Sci. 
USA 85(16):5879-5883. A number of methods have been described to discern chemical 
structures for converting tiie naturally aggregated— but chemically separated— light and 
heavy polypeptide chains from an antibody V region into an sFv molecule which will 

10 fold into a three dimensional structure substantially similar to the structure of an 
antigen-binding site. See, U.S. Pat Nos. 5,091,513 and 5,132,405, to Huston et al.; 
and U.S. Pat No. 4,946,778, to Ladna: et al. 

Each of the above-described molecules includes a heavy chain and a 
light chain CDR set, respectively interposed between a heavy chain and a light chain FR 

15 set which provide support to the CDRS and define the spatial relationship of the CDRs 
relative to each other. As used herein, the term "CDR set" refers to the three 
hypervariable regions of a heavy or light chain V region. Proceeding from the N- 
terminus of a heavy or light chain, these regions are denoted as "CDRl," "CDR2," and 
"CDR3" respectively. An antigen-binding site, therefore, includes six CDRs, 

20 comprising the CDR set fix)m each of a heavy and a light chain V region. A polypeptide 
comprismg a smgle CDR, (eg., a CDRl, CDR2 or CDR3) is referred to herein as a 
"molecular recognition unit" Ciystallographic analysis of a numb^ of antigen-antibody 
complexes has demonstrated that the amino acid residues of CDRs form extensive 
contact with bound antigen, wherein the most extensive aniiigen contact is with the 

25 heavy chain CDRS. Thus, the molecular recognition units are primarily responsible for 
the specificity of an antigen-binding site. 

As used herein, the term "FR set" refers to the four flanking amino acid 
sequences vAdch fiame the CDRs of a CDR set of a heavy or light chain V re^on. 
Some FR residues may contact bound antigen; however, FRs are primarily responsible 

30 for folding the V region into the antigen-binding site, particularly the FR residues 
directly adjacent to the CDRS. Within FRs, certain amino residues and certain structural 
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features are very highly conserved. In this regard, all V region sequences contain an 
internal disulfide loop of around 90 amino acid residues. When the V regions fold into a 
binding-site, the CDRs are displayed as projecting loop motifs which form an antigen- 
binding surface. It is generally recognized that there are conserved structural regions of 
5 FRs which influence the folded shape of the CDR loops into certain "canonical" 
structures-regardless of the precise CDR amino acid sequence. Further, certain FR 
residues are known to participate in non-covalent interdomain contacts which stabilize 
the interaction of the antibody heavy and light chains. 

A number of "humanized" antibody molecules comprising an antigen- 
ic binding site derived from a non-human immunoglobulin have been described, including 
chimeric antibodies having rodent V regions and their associated CDRs fused to human 
constant domains (Winter et al. (1991) Nature 349:293-299; Lobuglio et al. (1989) 
Proc. Nat Acad. Sci. USA 86:4220-4224; Shaw et al. (1987) J Immunol 138:4534- 
4538; and Brown et al. (1987) Cancer Res. 47:3577-3583), rodent CDRs grafted into a 
15 human siq>porting FR prior to fusion with an appropriate human antibody constant 
domain (Riechmann et al. (1988) Nature 332:323-327; Verhoeyen et al. (1988) Science 
239:1534-1536; and Jones et al. (1986) Nature 321:522-525), and rodent CDRs 
si^ported by recombinantly veneered rodent FRs (European Patent Publication No. 
519,596, published Dec. 23, 1992). These 'liumanized" molecules are designed to 
20 miTn'TniTft unwanted immunological response toward rodent antihuman antibody 
molecules ^diich limits the duration and effectiveness of therapeutic applications of 
tiiose moieties in human recipients. 

As used herein, the terms "veneered FRs" and "recombinantly veneered 
FRs" refer to the selective replacement of FR residues fit)m, e.g. , a rodent heavy or light 
25 chain V region, with human FR residues in order to provide a xenogeneic molecule 
comprising an antigen-binding site which retains substantially all of the native FR 
polypeptide folding structure. Veneering techniques are based on the understanding that 
the ligand binding characteristics of an antigen-binding site are determined primarily by 
flie structure and relative disposition of the heavy and light chain CDR sets within the 
30 antigen-binding surface. Davies et al. (1990) Ann. Rev. BiochenL 59:439-473. Thus, 
antigen binding sfpeciflcity can be preserved in a hxmianized antibody only \^erein the 
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CDR structures, their interaction with each other, and their interaction with the rest of 
flie V region domains are carefiilly maintained. By using veneering techniques, exterior 
solvent-accessible) FR residues which are readily encountered by the immune 
system are selectively replaced with human residues to provide a hybrid molecule that 
S comprises either a weakly immunogenic, or substantially non-immunogenic veneered 
surface. 

The process of veneering makes use of the available sequence data for 
human antibody variable domains compiled by Kabat et al., in Sequences of Proteins of 
Immunological Literest, 4th ed, (U.S. Dept of Health and Human Services, U.S. 

10 Government Printing OfGce, 1987), iq)dates to the Kabat database, and other accessible 
U.S. and foreign databases (both nucleic acid and protein). Solvent accessibilities of V 
region amino acids can be deduced from the known three-dimensional structure for 
human and murine antibody fragments. There are two general steps in veneering a 
murine antigen-binding site. Initially, the FRs of the variable domains of an antibody 

15 molecule of interest are compared with corresponding FR sequences of human variable 
domains obtained fiom the above-identified sources. The most homologous human V 
regions are then compared residue by residue to corresponding murine amino acids. The 
residues in the murine FR v^ch differ from the human counterpart are replaced by the 
residues present in the human moiety using recombinant techniques well known ui the 

20 art. Residue switching is only carried out with moieties which are at least partially 
e3q>osed (solvent accessible), and care is exercised in the rq)lacement of amino acid 
residues which may have a significant effect on the tertiary structure of V region 
domains, such as proline, glycine and charged amino acids. 

In this manner, the resultant "veneered" murine antigen-binding sites are 

25 thus designed to retain the murine CDR residues, the residues substantially adjacent to 
the CDRs, the residues identified as buried or mostly buried (solvent inaccessible), the 
residues believed to participate in non-covalent (e.g., electrostatic and hydrophobic) 
contacts between heavy and light chain domains, and the residues from conserved 
structural re^ons of the FRs vMoh are believed to influence the "canonical" tertiary 

30 structures of the CDR loops. These design criteria are then used to prepare recombinant 
nucleotide sequences which combine the CDRs of both the heavy and light chain of a 
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murine antigen-binding site into human-appearing FRs that can be used to transfect 
mammalian cells for the expression of recombinant human antibodies which exhibit the 
antigen specificity of the murine antibody molecule. 

In another embodiment of the invention, monoclonal antibodies of the 
5 present iavention may be coupled to one or more therapeutic agents. Suitable agents in 
this regard include radionuclides, differentiation inducers, drugs, toxins, and derivatives 
tiiereof. Preferred radionucUdes include ^^I, ^"l, ^^e, ^^^Re, ^^^At, and 
^^^Bi. Preferred drugs include methotrexate, and pyrimidine and purine analogs. 
Preferred differentiation inducers include phorbol esters and butyric acid. Preferred 

10 toxins include ricin, abrin, diptiieria toxin, cholera toxin, gelonin, Pseudomonas 
exotoxin. Shigella toxin, and pokeweed antiviral protein. 

A therapeutic agent may be coupled (e.g,y covalentiy bonded) to a 
suitable monoclonal antibody either directiy or indirectiy (e.g., via a linker group). A 
direct reaction between an agent and an antibody is possible when each possesses a 

15 substituent capable of reacting with the other. For example, a nucleophilic group, such 
as an amino or sulfhydiyl group, on one may be capable of reacting with a carbonyl- 
containing group, such as an anhydride or an acid halide, or with an alkyl group 
containing a good leaving group (e.g. , a halide) on the other. 

Alternatively, it may be desirable to couple a tiierapeutic agent and an 

20 antibody via a linker group. A linker group can function as a spacer to distance an 
antibody fix>m an agent in order to avoid interference with binding cqiabilities. A linker 
group can also serve to increase the chemical reactivity of a substituent on an agent or 
an antibo(fy, and thus increase the coupling efficiency. An increase in chemical 
reactivity may also facilitate the use of agents, or functional groups on agents, which 

25 otherwise would not be possible. 

It will be evident to those skilled in the art that a variety of bifunctional 
or polyfunctional reagents, both homo- and hetero-functional (such as those described in 
the catalog of the Pierce Chemical Co., Rockford, IL), may be employed as the linker 
group. Coupling may be effected, for example, through amino groups, carboxyl groups, 

30 sulfhydiyl groiq)s or oxidized carbohydrate residues. There are numerous references 
describing such methodology, e.g., U.S. Patent No. 4,671,958, to Rodwell et al. 
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Where a therapeutic agent is more potent when free from the antibody 
portion of the inmiunoconjugates of the present invention, it may be desirable to use a 
liiiker group which is cleavable during or upon internalization into a cell. A number of 
different cleavable linker groups have been described. The mechanisms for the 
5 intracellular release of an agent from these linker groups include cleavage by reduction 
of a disulfide bond {e.g., U.S. Patent No. 4,489,710, to Spitler), by irradiation of a 
photolabile bond (e.g., U.S. Patent No. 4,625,014, to Senter etal.), by hydrolysis of 
derivatized amino acid side chains (e.g., U.S. Patent No. 4,638,045, to Kohn et al.), by 
serum complement-mediated hydrolysis (eg., U.S. Patent No. 4,671,958, to Rodwell 

10 et al.), and acid-catalyzed hydrolysis (e.g., U.S. Patent No. 4,569,789, to Blattler et al.). 

It may be desirable to couple more ihrn one agent to an antibody. In one 
embodiment, multiple molecules of an agent are coupled to one antibody molecule. In 
another embodiment, more than one type of agent may be coupled to one antibody. 
Regardless of the particular embodiment, immunoconjugates with more than one agent 

15 may be prepared in a variety of ways. For example, more than one agent may be 
coupled directly to an antibody molecule, or linkers that provide multiple sites for 
attachment can be used Alternatively, a caffier can be used. 

A carrier may bear the agents in a variety of ways, including covalent 
bonding either directly or via a link^ groiq). Suitable carriers include proteins such as 

20 albumins {e,g,, U.S. Patent No. 4,507,234, to Kato et al.), peptides and polysaccharides 
such as aminodextian (e.g., U.S. Patent No. 4,699,784, to Shih et al.). A carrier may 
also bear an agent by noncovalent bonding or by encapsulation, such as within a 
liposome vesicle (e.g., U.S. Patent Nos. 4,429,008 and 4,873,088). Carriers specific for 
radionuclide agents include radiohalogenated small molecules and chelating 

25 compounds. For example, U.S. Patent No. 4,735,792 discloses representative 
radiohalogenated small molecules and their synthesis. A radionuclide chelate may be 
formed from chelating compounds that include those containing nitrogen and sulfijr 
atoms as the donor atoms for binding the metal, or metal oxide, radionuclide. For 
example, U.S. Patent No. 4,673,562, to Davison et al. discloses representative chelating 

30 compounds and their synthesis. 
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T Cell CompositioDs 

The present invention, in another aspect, provides T cells specific for a 
tumor polypeptide disclosed herein, or for a variant or derivative thereof. Such cells 
may generally be prepared in vitro or ex vivo, using standard procedures. For example, 
5 T cells may be isolated firom bone marrow, peripheral blood, or a firaction of bone 
marrow or peripheral blood of a patient, using a commercially available cell separation 
system, suph as the Isolex™ System, available fix)m Nexell Therapeutics, Inc. (Irvine, 
CA; see also U.S. Patent No. 5,240,856; U.S. Patent No. 5,215,926; WO 89/06280; WO 
91/16116 and WO 92/07243). Alternatively, T cells may be derived from related or 

1 0 unrelated humans, non-human mammals, cell lines or cultures. 

T cells may be stimulated with a polypeptide, polynucleotide encoding a 
polypeptide and/or an antigen presenting cell (APC) that expresses such a polypeptide. 
Such stimulation is performed under conditions and for a time sufficient to permit the 
generation of T ceUs that are specific for the polypeptide of interest Prefembly, a tumor 

15 polypeptide or polynucleotide of the invention is present within a delivery vehicle, such 
as a microsphere, to facilitate the generation of specific T cells. 

T cells are considered to be specific for a polypeptide of the present 
invention if the T cells specifically proliferate, secrete cytokines or kill target cells 
coated with the polypeptide or expressmg a gene encoding the polypeptide. T cell 

20 specificity may be evaluated using any of a variety of standard techniques. For 
example, within a chromium release assay or proliferation assay, a stimulation index of 
more than two fold increase in lysis and/or proliferation, compared to negative controls, 
indicates T cell specificity. Such assays may be performed, for example, as described in 
Chen et al., Cancer Res. 5^:1065-1070, 1994. Alternatively, detection of the 

25 proliferation of T cells may be accomplished by a variety of known techniques. For 
example, T cell proliferation can be detected by measuring an increased rate of DNA 
synthesis by pulse-labeling cultures of T cells with tritiated thymidine and 
measuring the amount of tritiated thymidine incorporated into DNA). Contact with a 
tumor polypeptide (100 ng/ml - 100 fig/ml, preferably 200 ng/ml - 25 [ig/ml) for 3 - 7 

30 days will typically result in at least a two fold increase in proliferation of the T cells. 
Contact as described above for 2-3 hours should result in activation of the T cells, as 
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measuied using standard cytokine assays in which a two fold increase in the level of 
cytokine release (e.g., TNF or IFN-y) is indicative of T cell activation {see Coligan et 
al.. Current Protocols in Immunology, vol. 1, Wiley Interscience (Greene 1998)). T 
cells that have been activated in response to a tumor polypeptide, polynucleotide or 
5 polypeptide-expressing APC may be CD4^ and/or CDS"^. Tumor polypeptide-specific T 
cells may be expanded using standard techniques. Withm preferred embodiments, the T 
cells are derived horn a patient, a related donor or an unrelated donor, and are 
administered to the patient following stimulation and expansion. 

For ther^utic purposes, 004"^ or CD8"*" T cells that proliferate in 

10 response to a tumor polypeptide, polynucleotide or APC can be expanded in number 
either in vitro or in vivo. Proliferation of such T cells in vitro may be accomplished in a 
variety of ways. For example, the T cells can be re-exposed to a tumor polypeptide, or a 
short peptide corresponding to an immunogenic portion of such a polypeptide, with or 
without the addition of T cell growth factors, such as interleukin-2, and/or stimulator 

15 cells that synthesize a tumor polypeptide. Alternatively, one or more T cells that 
proliferate in the presence of the tumor polypeptide can be expanded in number by 
cloning. Methods for cloning cells are well known in the art, and include limiting 
dilution. 

Pharmaceutical Compositions 

20 In additional embodiments, Uie present mvention concerns formulation 

of one or more of the polynucleotide, polypqjtide, T-cell and/or antibody compositions 
disclosed herein in pharmaceudcally-acceptable carriers for administration to a cell or 
an animal, either alone, or in combination with one or more other modalities of therapy. 

It will be understood that, if desired, a composition as disclosed herein 

25 may be administered in combination with other agents as well, such as, e.g., other 
proteins or polypeptides or various pharmaceutically-active agents. In fact, there is 
virtually no limit to other components that may also be included, given that the 
additional agents do not cause a significant adverse effect upon contact with the target 
cells or host tissues. The compositions may thus be delivered along with various other 

30 agents as required in the particular instance. Such compositions may be purified firom 
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host cells or other biological sources, or alternatively may be chemically synthesized as 
described herein- Likewise, such compositions may further comprise substituted or 
derivatized RNA or DNA compositions. 

Therefore, in another aspect of the present invention, pharmaceutical 
5 compositions are provided comprising one or more of the polynucleotide, polypeptide, 
antibody, and/or T-cell compositions described herein in combination with a 
physiologically acceptable carri^. In certain preferred embodiments, the 
pharmaceutical compositions of the invention comprise immunogenic polynucleotide 
and/or polypeptide conq>ositions of the invention for use in prophylactic and themputic 

10 vaccine applications. Vaccine preparation is generally described in, for example, M.F. 
Powell and M.J. Newman, eds., "Vaccine Design (the subunit and adjuvant approach)," 
Plenum Press (NY, 1995). Generally, such compositions will comprise one or more 
polynucleotide and/or polypeptide compositions of the present invention in combination 
with one or more immunostimulants. 

15 It will be apparent that any of the pharmaceutical compositions described 

herein can contain pharmaceutically acceptable salts of the polynucleotides and 
polypeptides of the inventioa Such salts can be prepared, for example, from 
pharmaceutically acceptable non-toxic bases, including organic bases (e.g., salts of 
primary, secondary and tertiary amines and basic amino acids) and inorganic bases (e,g„ 

20 sodium, potassium, litiiium, ammonium, calcium and magnesium salts). 

In another embodiment, illustrative immimogenic compositions, e.g., 
vaccine compositions, of the present invention comprise DNA encoding one or more of 
the polypeptides as desoibed above, such that the polypeptide is generated in situ. As 
noted above, the polynucleotide may be administered within any of a variety of delivery 

25 systems known to those of ordinary skill in the art. Indeed, numerous gene delivery 
techniques are well known in the art, such as those described by Rolland, Crit Rev. 
Therqp. Drug Carrier Systems 75:143-198, 1998, and references cited therein. 
Appropriate poljomcleotide expression systems will, of course, contain the necessary 
regulatory DNA regulatory sequences for expression in a patient (such as a suitable 

30 promoter and terminating signal). Alternatively, bacterial delivery systems may mvolve 
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the administration of a bacterium (such as Bacillus-Calmette-Guerriii) that expresses an 
immunogenic portion of the polypeptide on its cell surface or secretes such an epitope. 

Therefore, in certain embodiments, polynucleotides encoding 
immimogenic polypeptides described herein are introduced into suitable mammalian 
5 host cells for expression using any of a number of known viral-based systems. In one 
illustrative embodiment, retroviruses provide a convenient and effective platform for 
gene delivery systems. A selected nucleotide sequence encoding a polypeptide of the 
present invention can be inserted into a vector and packaged in retroviral particles using 
techniques known in the art The recombinant virus can then be isolated and delivered 

10 to a subject A number of illustrative retroviral systems have been described (e.g., U.S. 
Pat No. 5,219,740; MiUer and Rosman (1989) BioTechniques 7:980-990; MiUer, A. D. 
(1990) Human Gene Therapy 1:5-14; Scarpa et al. (1991) Virology 180:849-852; Bums 
et al. (1993) Proc, Natl. Acad. Sci. USA 90:8033-8037; and Boris-Lawrie and Temin 
(1993) Cur. Opin. Genet Develop. 3:102-109. 

15 In addition, a nimiber of illustrative adenovirus-based systems have also 

been described. Unlike retroviruses \\iiich integrate into the host genome, adenoviruses 
persist extrachromosomally thus minimizing the risks associated with insertional 
mutagenesis (Haj-Ahmad and Graham (1986) J. Virol. 57:267-274; Bett et al. (1993) L 
Virol. 67:591 1-5921; Mittereder et al. (1994) Human Gene Therapy 5:717-729; Seth et 

20 al. (1994) J. Virol. 68:933-940; Bair et al. (J994) Gene Ther^y 1:51-58; Berkner, K. L. 
(1988) BioTechniques 6:616-629; and Rich et al. (1993) Human Gene Ther^y 4:461- 
476). 

Various adeno-associated virus (AAV) vector systems have also been 
developed for polynucleotide deliveiy. AAV vectors can be readily constructed using 

25 techniques well known in the art See, e,g,, U.S. Pat Nos. 5,173,414 and 5,139,941; 
hitemational Publication Nos. WO 92/01070 and WO 93/03769; Lebkowski et al. 
(1988) Molec. CeU. Biol. 8:3988-3996; Vmcent et al. (1990) Vaccines 90 (Cold Spring 
Harbor Laboratory Press); Carter, B. J. (1992) Current Opinion in Biotechnology 3:533- 
539; Muzyczka, N. (1992) Current Topics in Aficrobiol. and Immunol. 158:97-129; 

30 Kotin, R. M. (1994) Human Gene Therapy 5:793-801; Shelling and Smith (1994) Gene 
Tbstapy 1:165-169; and Zhou et al. (1994) J. Exp. Med. 179:1867-1875. 



64 



wo 02/39885 



PCT/USOl/45395 



Additional viral vectors useful for delivering the polynucleotides 
encoding polypeptides of the present invention by gene transfer include those derived 
from the pox famil y of viruses, such as vaccinia virus and avian poxvirus. By way of 
example, vaccinia virus recombinants expressing the novel molecules can be 
5 constructed as follows. The DNA encoding a polypeptide is first inserted into an 
appropriate vector so that it is adjacent to a vaccinia promoter and flanking vaccinia 
DNA sequences, such as the sequence encoding thymidine kinase (TK). This vector is 
then used to transfect cells ^ch are simultaneously infected with vaccinia 
Homologous recombination serves to insert the vaccinia promote plus the gene 

10 encoding the polypeptide of interest into the viral genome. The resulting TKLsup.(-) 
recombinant can be selected by culturing the cells in the presence of S- 
bromodeoxyuridine and picking viral plaques resistant thereto. 

A vaccinia-based infection/transfection system can be conveniently used 
to provide for inducible, transient expression or coexpression of one or more 

15 polypeptides described herein in host cells of an organism. In this particular system, 
cells are first infected in vitro with a vaccinia virus recombinant that encodes the 
bacteriophage T7 RNA polymerase. This polymerase displays exquisite specificity in 
that it only transcribes templates bearing T7 promoters. Following infection, cells are 
transfected with the polynucleotide or polynucleotides of interest, driven by a T7 

20 promoter. The polymerase expressed in the cytoplasm from the vaccinia virus 
recombinant transcribes the transfected DNA into RNA which is then translated into 
polypeptide by the host translational machinery. The method provides for high level, 
transient, cytoplasmic production of large quantities of KNA and its translation 
products. See, e.g., Ehroy-Stein and Moss, Proc. Natl. Acad. Sci. USA (1990) 87:6743- 

25 6747; Fuerst et al. Proc. Natl. Acad. Sci. USA (1986) 83:8122-8126. 

Alternatively, avipoxviruses, such as the fowlpox and canarypox viruses, 
can also be used to deliver the coding sequences of interest. Recombinant avipox 
viruses, expressing inmnmogens fi-om mammalian pathogens, are known to confer 
protective immunity \^en administered to non-avian species. The use of an Avipox 

30 vector is particularly desirable in human and other mammalian species since members 
of the Avipox genus can only productively replicate in susceptible avian species and 
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therefore are not infective in mammalian cells. Methods for producing recombinant 
Avipoxviruses are known in the art and employ genetic recombination, as described 
above with respect to the production of vaccinia viruses. See, e.g., WO 91/12882; WO 
89/03429; and WO 92/03545. 
5 Any of a number of alphavirus vectors can also be used for delivery of 

polynucleotide compositions of the present invention, such as those vectors described in 
U.S. Patent Nos. 5,843,723; 6,015,686; 6,008,035 and 6,015,694. Certain vectors based 
on Venezuelan Equine Encephalitis (VEE) can also be used, illustrative examples of 
which can be found in U.S. Patent Nos. 5,505,947 and 5,643,576. 

10 Moreover, molecular conjugate vectors, such as the adenovirus chimeric 

vectors described in Michael et al. J. Biol. Chem. (1993) 268:6866-6869 and Wagner et 
al. Proc. Natl. Acad. Sci. USA (1992) 89:6099-6103, can also be used for gene delivery 
under the invention. 

Additional illustrative information on these and other known viral-based 

15 delivery systems can be foimd, for example, in Fisher-Hoch et al., Proc. Natl Acad Set 
USA 55:317-321, 1989; Flexner et ai,,Ann KY, Acad Sci 569M-m, 1989; Rexner 
et al., Vaccine 5:17-21, 1990; U.S. Patent Nos. 4,603,112, 4,769,330, and 5,017,487; 
WO 89/01973; U.S. Patent No. 4,777,127; GB 2,200,651; EP 0,345,242; 
WO 91/02805; Berkner, Biotechniques 5:616-627, 1988; Rosenfeld et al.. Science 

20 252:431-434, 1991; KoUs et al„ Proc. Natl. Acad Sd. USA Pi:215-219, 1994; 
Kass-Eisler et al., Proc Natl. Acad Sci USA P0:1 1498-1 1502, 1993; Guzman et al., 
Circulation 55:2838-2848, 1993; and Guzman et al., Cir. Res. 73:1202-1207, 1993. 

In certain embodimrats, a polynucleotide may be integrated into the 
genome of a target cell. This integration may be in the specific location and orientation 

25 via homologous recombination (gene replacement) or it may be integrated in a random, 
non-specific location (gene augmentation). In yet fiirther embodiments, the 
polynucleotide may be stably maintained in the cell as a separate, episomal segment of 
DNA. Such polynucleotide segments or "episomes" encode sequences sufiBcient to 
permit maintenance and replication independent of or in synchronization with the host 

30 cell c^cle. The mann^ in ^^ch the expression construct is delivered to a cell and 
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where in the cell the polynucleotide remains is dependent on the type of expression 
construct employed. 

In another embodiment of the invention, a polynucleotide is 
adniimstered/delivered as "naked" DNA, for example as described in Ulmer et al., 
5 Science 25P: 1745- 1749, 1993 and reviewed by Cohen, Science 25P: 1691-1692, 1993. 
The uptake of naked DNA may be increased by coatinig tiie DNA onto biodegradable 
beads, which are efiSciently transported into the cells. 

In still another embodiment, a composition of the present invention can 
be delivered via a particle bombardment approach, many of which have been described. 
10 In one illustrative example, gas-driven particle acceleration can be achieved with 
devices such as those manufactured by Powderject Pharmaceuticals PLC (Oxford, UK) 
and Powdeqect Vaccines Inc. (Madison, WI), some examples of vMch are described m 
U.S. Patent Nos. 5,846,796; 6,010,478; 5,865,796; 5,584,807; and EP Patent No. 0500 
799. This approach offers a needle-free delivery approach wherein a dry powder 
15 formulation of microscopic particles, such as polynucleotide or polypeptide particles, 
are accelerated to high speed within a helium gas jet generated by a hand held device, 
propelling the particles into a target tissue of interest. 

In a related embodimmt, other devices and methods that may be useful 
for gas-driven needle-less injection of compositions of the present invention include 
20 those provided by Bioject, Inc. (Portland, OR), some examples of which are described 
m U.S. Patent Nos. 4,790,824; 5,064,413; 5,312,335; 5,383,851; 5,399,163; 5,520,639 
and 5,993,412. 

According to another embodiment, the pharmaceutical compositions 
described herein will comprise one or more immunostimulants in addition to the 

25 immunogenic polynucleotide, polypeptide, antibody, T-cell and/or APC compositions 
of tills inventiorL An immxmostimulant refers to essentially any substance that enhances 
or potentiates an immune response (antibody and/or cell-mediated) to an exogenous 
antigen. One preferred type of immunostimulant comprises an adjuvant Many 
adjuvants contain a substance designed to protect the antigen from r^id catabolism, 

30 such as aluminum hydroxide or mineral oil, and a stimulator of immune responses, such 
as lipid A, Bortadella pertussis or Afycobacteritm tuberculosis derived proteins. 
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Certain adjuvants are commercially available as, for example, Freund's Incomplete 
Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, MI); Merck Adjuvant 
65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline Beecham, Philadelphia, 
PA); aliuninutn salts such as aluminum hydroxide gel (alum) or aluminum phosphate; 
5 salts of calcium, iron or zinc; an insoluble suspension of acylated tyrosine; acylated 
sugars; cationically or anionically derivatized polysaccharides; polyphosphazenes; 
biodegradable microspheres; monophosphoryl Upid A and quil A. Cytokines, such as 
GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be used as 
adjuvants. 

10 Witiiin certain embodiments of the invention, the adjuvant composition 

is preferably one that induces an immune response predominantly of the Thl type. High 
levels of Thl-type cytokines (e.g., IFN-y, TNFo, IL-2 and IL-12) tend to favor the 
induction of cell mediated immune responses to an administered antigen. In contrast, 
high levels of Th2-type cytokines (e.g., IL-4, IL-5, IL-6 and IL-10) tend to favor the 

15 induction of hiunoral immune responses. Following application of a vaccine as 
provided herein, a patient will support an immune response that includes Thl- and Th2- 
type responses. Within a preferred embodiment, in which a response is predominantly 
Thl-type, &e level of Thl-type cytokines will increase to a greater extent than the level 
of Th2-type cytokines. The levels of these cytokines may be readily assessed using 

20 standard ass^s. For a review of the families of c^okines, see Mosmann and Cof&nan, 
Am Rev. Immunol 7:145-173, 1989. 

Certain preferred adjuvants for eliciting a predominantly Thl-type 
response include, for example, a combination of monophosphoryl lipid A, preferably 3- 
de-O-acylated monophosphoryl lipid A, together with an alimiinum salt MPL® 

25 adjuvants are available from Corixa Corporation (Seattle, WA; see, for example, US 
Patent Nos. 4,436,727; 4,877,611; 4,866,034 and 4,912,094). CpG-containing 
oligonucleotides (in which the CpG dinucleotide is unmethylated) also induce a 
predominantly Thl response. Such oligonucleotides are well known and are described, 
for example, .m WO 96/02555, WO 99/33488 and U.S. Patent Nos. 6,008,200 and 

30 5,856,462. Immunostimulatory DNA sequences are also described, for example, by 
Sato et al.. Science 273:352^ 1996. Anoflier preferred adjuvant comprises a saponin. 
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such as Quil A, or derivatives thereof, including QS21 and QS7 (Aquila 
Biopharmaceuticals Inc., Framingham, MA); Escin; Digitonin; or Gypsophila or 
Chenopodium quinoa saponins . Other preferred formulations include more than one 
saponin in the adjuvant combinations of the present invention, for example 
5 combinations of at least two of the following group comprising QS21, QS7, Quil A, p- 
escin^ or digitoniiL 

Alternatively the saponin formulations may be combined with vaccine 
vehicles composed of chitosan or other poiycationic polymers, polylactide and 
polylactide-co-glycolide particles, poly-N-acetyl glucosamme-based polymer matrix, 

10 particles composed of polysaccharides or chemically modified polysaccharides, 
liposomes and lipid-based particles, particles composed of glycerol monoesters, etc. The 
saponins may also be formulated in the presence of cholesterol to form particulate 
structures such as liposomes or ISCOMs. Furthermore, the saponins may be formulated 
together with a polyoxyethylene ether or ester, in either a non-particxilate solution or 

1 5 suspension, or in a particulate structure such as a paucilamelar liposome or ISCOM. The 
saponins may also be formulated with excipients such as Carbopol^ to increase 
viscosity, or may be formulated in a dry powder form with a powder excipient such as 
lactose. 

In one preferred embodiment, the adjuvant system includes the 
20 combination of a monophosphoryl Upid A and a saponin derivative, such as the 
combination of QS21 and 3D-MPL® adjuvant, as desmbed in WO 94/00153, or a less 
reactogenic composition where the QS21 is quenched with cholesterol, as described in 
WO 96/33739. Other prefeiited formulations comprise an oil-in-water emulsion and 
tocopherol. Anodier particularly preferred adjuvant formulation employing QS21, 3D- 
25 MPL® adjuvant and tocopherol in an oil-in-water emulsion is described in WO 
95/17210. 

Another enhanced adjuvant system involves the combination of a CpG- 
containing oligonucleotide and a saponin derivative particularly the combination of 
CpG and QS21 is disclosed in WO 00/09159. Preferably the formulation additionally 
30 comprises an oil in water emulsion and tocopherol. 
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Additional illustrative adjuvants for use in the pharmaceutical 
compositions of the invention include Montanide ISA 720 (Seppic, France), SAF 
(Chiron, California, United States), ISCOMS (CSL), MF-59 (Chiron), the SBAS series 
of adjuvants (e.g., SBAS-2 or SBAS-4, available from SmithKline Beecham, Rixensart, 
5 Belgium), Detox (Enhanzyn®) (Corixa, Hamilton, MT), RC-529 (Corixa, Hamilton, 
MT) and other aminoalkyl glucosaminide 4-phosphates (AGPs), such as those described 
m pending U.S. Patent AppUcation Serial Nos. 08/853,826 and 09/074,720, the 
disclosures of \^ch are incorporated h^in by reference in their entireties, and 
polyoxyethylene ether adjuvants such as those described in WO 99/52549A1. 
10 Other preferred adjuvants include adjuvant molecules of the general 

formula 

(I): HO(CH2CH20)n-A-R, 
wherein, n is 1-50, A is a bond or -C(0)-, R is C1.50 alkyl or Phenyl C1.50 alkyl 

One embodiment of the present invention consists of a vaccine 

15 formulation comprising a polyoxyethylene ether of general formula (I), wherein n is 
between 1 and 50, preferably 4-24, most preferably 9; the R component is C1.50, 
preferably C4-C20 alkyl and most preferably C12 alkyl, and ^ is a bond. The 
concentration of the polyoxyethylene ethers should be in the range 0.1-20%, preferably 
from 0.1-10%, and most prefwably in the range 0.1-1%. Preferred polyoxyethylene 

20 ethers are selected from the following group: polyoxyethylene-9-lauryl ether, 
polyoxyethylene-9-steoryl ether, polyoxyefliylene-8-steoryl ether, polyoxye1hylene-4- 
lamyl ether, polyoxyethylene-35-lauryl ether, and polyoxyethylene-23-lauryl ether. 
Polyoxyethylene eth^s such as polyoxyethylene lauryl ether are described in the Merck 
index (12*^ edition: entry 7717). These adjuvant molecules are described in WO 

25 99/52549. 

The polyoxyethylene ether according to the general formula (I) above 
may, if desired, be combined with another adjuvant For example, a preferred adjuvant 
combination is preferably with CpG as described in the pending UK patent application 
GB 9820956.2. 

30 According to anotiier embodiment of this invention, an immunogenic 

composition described herein is delivered to a host via antigen presenting cells (APCs), 
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such as dendritic cells, macrophages, B cells, monocytes and other cells that may he 
engineered to be eflBcient APCs. Such cells may, but need not, be genetically modified 
to increase the cq)acity for presenting the antigen, to improve activation and/or 
maintenance of the T cell response, to have anti-tumor effects per se and/or to be 
5 immunologically compatible with the receiver (z.e., matched HLA haplotype). APCs 
may generally be isolated fix>m any of a variety of biological fluids and organs, 
including tumor and peritumoral tissues, and may be autologous, allogeneic, syngeneic 
or xenogeneic cells. 

Certain preferred embodiments of the present invention use dendritic 
10 cells or progenitors thereof as antigen-presenting cells. Dendritic cells are highly potent 
APCs (Banchereau and Steinman, Nature 392:245-251 ^ 1998) and have been shown to 
be effective as a physiological adjuvant for elicitii^ prophylactic or therapeutic 
antitumor immunity (see Timmerman and Levy, Ann Rev, Med 50:507-529, 1999). In 
gen^, dendritic cells may be identified based on their typical shape (stellate in situ, 
IS with mariced cytoplasmic processes (dendrites) visible in vitro\ &eir ability to take up, 
process and present antigens with high efGciency and their ability to activate naive T 
cell responses. Dendritic cells may, of course, be engmeered to express* specific cell- 
surface receptors or ligands that are not commonly found on dendritic cells in vivo or ex 
vivo, and such modified dendritic cells aie contemplated by the present invention. As 
20 an alternative to dendritic cells, secreted vesicles antigen-loaded dendritic cells (called 
exosomes) may be used within a vaccine (see Zitvogel et al.. Nature Med ^:594-600, 
1998). 

Dendritic cells and progenitors may be obtained &om peripheral blood, 
bone marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph 

25 nodes, spleen, skin, umbilical cord blood or any other suitable tissue or fluid For 
example, dendritic cells may be differentiated ex vivo by adding a combination of 
Q^okines such as GM-CSF, IL-4, IL-13 and/or TNFa to cultures of monocytes 
harvested fix)m peripheral blood Alternatively, CD34 positive cells harvested fix)m 
peripheral blood, umbilical cord blood or bone marrow may be diflferentiated into 

30 dendritic cells by adding to tiie culture medium combinations of GM-CSF, IL-3, TNFcx, 
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CIMO lig^nd, LPS, flt3 ligand and/or other compoiind(s) that induce differentiation, 
maturation and proliferation of dendritic cells. 

Dendritic cells are convenientiy categorized as "immature" and "mature" 
cells, which allows a simple way to discriminate between two well characterized 
5 phenotypes. However, tMs nomenclature should not be construed to exclude all 
possible intermediate stages of differentiation. Immature dendritic cells are 
characterized as APC witii a high capacity for antigen uptake and processing, which 
correlates with the high e7q)ression of Fey receptor and mannose receptor. The mature 
phenotype is typically characterized by a lower e;q)ression of these markers, but a high 

10 expression of ceU surface molecules responsible for T cell activation such as class I and 
class n MHC, adhesion molecules (e.g., CDS4 and CDll) and costimulatory molecules 
CD40, CD80, CD86 and 4-lBB). 

APCs may generally be transfected with a polynucleotide of the 
invention (or portion or other variant thereof) such that the encoded polypeptide, or an 

15 immunogenic portion thereof^ is expressed on the cell surface. Such transfection may 
take place ex vivo, and a pharmaceutical composition comprising such transfected cells 
may then be used for therapeutic purposes, as described herein. Alternatively, a gene 
delivery vehicle that targets a dendritic or other antigen presenting ceil may be 
administered to a padent, resultmg in transfection that occurs in vivo. In vivo and ex 

20 vivo transfection of dendritic cells, for example, may generally be performed using any 
methods known in the art, such as those described in WO 97/24447, or Hie gene gun 
approach described by Msibvi et al.. Immunology and cell Biology 75:456-460, 1997, 
Antigen loading of dendritic cells may be achieved by incubating dendritic cells or 
progenitor cells wifli the tumor polypeptide, DNA (naked or within a plasmid vector) or 

25 RNA; or with antigen-expressing recombinant bacterium or viruses {e.g., vaccinia, 
fowlpox, adenovirus or lentivirus vectors). Prior to loading, the polypeptide may be 
covalently conjugated to an immunological partner that provides T cell help (e.g., a 
carrier moleciile). Alternatively, a dendritic cell may be pulsed with a non-conjugated 
immunological partner, separately or in the presence of the polypeptide. 

30 Whileany suitable carrier known to tiiose of ordinary skill in the art may 

be employed in tiie pharmaceutical compositions of this invention, the type of carrier 
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will typically vary depending on the mode of adxninistmtioa Compositions of the 
present invention may be formulated for any appropriate manner of administration, 
including for example, topical, oral, nasal, mucosal, intravenous, intracranial, 
intraperitoneal, subcutaneous and intramuscular administratioiL 
5 Carriers for use within such pharmaceutical compositions are 

biocompatible, and may also be biodegradable. In certain embodiments, the 
fonnulation preferably provides a relatively constant level of active component release. 
In other embodiments, however, a more rapid rate of release immediately upon 
administration may be desired. The formulation of such compositions is well within the 

10 level of ordinary sirill in the art using known techniques. Illustrative carriers useful in 
this regard include microparticles of poly(lactide-co-glycolide), polyacrylate, latex, 
starch, cellulose, dextran and the like. Other illustrative delayed-release carriers 
include supramolecular biovectors, which comprise a non-liquid hydrophilic core (e.g., 
a cross-linked polysaccharide or oligosaccharide) and, optionally, an external layer 

15 comprising an amphiphilic compound, such as a phospholipid {see e,g., U.S. Patent No. 
5,151,254 and PCT appUcations WO 94/20078, WO/94/23701 and WO 96/06638). The 
amount of active compound contained within a sustained release formulation depends 
upon the site of implantation, the rate and expected duration of release and the nature of 
the condition to be treated or prevented. 

20 In another illustrative embodiment, biodegradable microspheres (e.g., 

polylactate polyglycolate) are employed as carriers for the compositions of this 
invention. Suitable biodegradable microspheres are disclosed, for example, in U.S. 
Patent Nos. 4,897,268; 5,075,109; 5,928,647; 5,811,128; 5,820,883; 5,853,763; 
5,814,344, 5,407,609 and 5,942,252. Modified hepatitis B core protein carrier systems. 

25 such as described in WO/99 40934, and references cited therein, will also be useful for 
many applications. Another illustrative carrier/delivery system employs a carrier 
comprising particulate-protein complexes, such as those described in U.S. Patent No. 
5,928,647, which are capable of inducing a class I-restricted cytotoxic T lymphocyte 
responses in a host 

30 The pharmaceutical compositions of the invention will often further 

comprise one or more buffers (e.g., neutral bufii^ed saline or phosphate buffered 
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saline), carbohydrates (e.g., glucose, mannose, sucrose or dextrans), mannitol, proteins, 
polypeptides or amino acids such as glycine, antioxidants, bacteriostats, chelating 
agents such as EDTA or glutathione, adjuvants (e.g., aluminum hydroxide), solutes that 
render the formulation isotonic, hypotonic or weakly hypertonic with the blood of a 
5 recipient, suspending agents, thickening agents and/or preservatives. Alternatively, 
compositions of the present invention may be formulated as a lyophilizate. 

The pharmaceutical compositions described herein may be presented in 
unit-dose or multi-dose containers, such as sealed ampoules or vials. Such containers 
are typically sealed in such a way to preserve the sterility and stability of the 

10 formulation until use. In general, formulations may be stored as suspensions, solutions 
or emulsions in oily or aqueous vehicles. Alternatively, a pharmaceutical composition 
may be stored in a fieeze-dried condition requiring only the addition of a sterile liquid 
carrier immediately prior to use. 

The development of suitable dosing and treatment regimens for using the 

15 particular compositions described herein in a variety of treatment regimens, including 
e.g,, oral, parenteral, intravenous, intranasal, and intramuscular administration and 
formulation, is well known in the art, some of which are briefly discussed below for 
general purposes of illustration. 

In certain apphcations, the pharmaceutical compositions disclosed herein 

20 may be delivered via oral administration to an animal. As such, these compositions 
may be formulated with an inert diluent or with an assimilable edible carrier, or they 
may be enclosed in hard- or soft-shell gelatin capsule, or they may be compressed into 
tablets, or they may be incorporated directiy with the food of the diet 

The active compounds may even be incorporated with excipients and 

25 used in the form of ingestible tablets, buccal tables, troches, capsules, elixirs, 
suspensions, syrups, wafers, and the like (see, for example, Mathiowitz et al. Nature 
1997 Mar 27;386(6623):410-4; Hwang et al, Crit Rev Ther Drug Carrier Syst 
1998;15(3):243-84; U. S. Patent 5,641,515; U. S. Patent 5,580,579 and U. S. Patent 
5,792,451). Tablets, troches, pills, capsules and the like may also contain any of a 

30 variety of additional components, for example, a binder, such as gum tragacanth, acacia, 
cornstarch, or gelatin; excipients, such as dicalcium pfhosphate; a disintegrating agent. 
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such as com starch, potato starch, alginic acid and the Uke; a lubricant, such as 
magnesium stearate; and a sweetening agent, such as sucrose, lactose or saccharin may 
be added or a flavoring agent, such as peppermint, oil of wintergreen, or cherry 
flavoring. When the dosage unit form is a capsule, it may contain, in addition to 
5 materials of the above type, a liquid carrier. Various other materials may be present as 
coatings or to otherwise modify Hxe physical form of the dos^^e unit For instance, 
tablets, pills, or capsules may be coated wi& shellac, sugar, or both. Of course, any 
material used in preparing any dosage unit form shoidd be phannaceutically pure and 
substantially non-toxic in the amounts employed. In addition, the active compounds 

1 0 may be incorporated into sustained-release preparation and formulations. 

Typically, these formulations wiU contain at least about 0.1% of the 
active compoimd or more, although the percentage of the active ingredient(s) may, of 
course, be varied and may conveniently be between about 1 or 2% and about 60% or 
70% or more of the weight or volume of the total formulation. Naturally, the amoxmt of 

15 active compound(s) in each therapeutically useful composition may be prepared is such 
a way that a suitable dosage will be obtained in any given unit dose of the compound 
Factors such as solubility, bioavailability, biological half-life, route of administration, 
product shelf life, as well as other pharmacological considerations will be contemplated 
by one skilled in the art of preparing such pharmaceutical formulations, and as such, a 

20 variety of dosages and treatment regimens may be desirable. 

For oral administration the compositions of the present invention may 
alternatively be incorporated with one or more excipients m the form of a mouthwash, 
dentifiice, buccal tablet, oral spray, or sublingual orally-administered formulation. 
Alternatively, the active ingredient may be incorporated into an oral solution such as 

25 one containing sodium borate, glycerin and potassixun bicarbonate, or dispersed in a 
dentifiice, or added in a therapexitically-eflFective amount to a composition that may 
include water, binders, abrasives, flavoring agents, foaming agents, and humectants. 
Alternatively the compositions may be fashioned into a tablet or solution form that may 
be placed under the tongue or otiierwise dissolved in the mouth. 

30 In certain circumstances it will be desirable to deliver the pharmaceutical 

compositions disclosed herein parenterally, intravenously, intramuscularly, or even 
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intraperitoneally. Such approaches aie well known to the skilled artisan, some of which 
are further described, for example, in U. S. Patent 5,543,158; U. S. Patent 5,641,515 
and U. S. Patent 5,399,363. In certain embodiments, solutions of the active compounds 
as free base or pharmacologically acceptable salts may be prepared in water suitably 
5 mixed with a surfectant, such as hydroxypropylcellulose. Dispersions may also be 
prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. 
Under ordinary conditions of storage and use, these preparations generally will contain a 
preservative to prevent the growth of microorganisms. 

Illustrative pharmaceutical forms suitable for injectable use include 

10 stmle aqueous solutions or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersions (for example, see U. S. Patent 
5,466,468). In all cases the form must be sterile and must be fluid to the extent that 
easy syringability exists. It must be stable under the conditions of manufacture and 
storage and must be preserved against the contaminating action of microorganisms, 

15 such as bacteria and fimgi. The carrier can be a solvent or dispersion medium 
containing, for example, water, ethanol, polyol {e,g„ glycerol, propylene glycol, and 
liquid polyethylene glycol, and the like), suitable mixtures thereof, and/or vegetable 
oils. Proper fluidity may be maintained, for example, by &e use of a coating, such as 
lecithin, by the maintenance of the required particle size in the case of dispersion and/or 

20 by the use of sur&ctants. Hie prevention of the action of microorganisms can be 
facilitated by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, sorbic acid, liumerosal, and the like. In many cases, it will be 
preferable to mclude isotonic agents, for example, sugars or sodium chloride. 
Prolonged absorption of the injectable compositions can be brought about by the use in 

25 the compositions of agents delaying absorption, for example, aluminum monostearate 
and gelatin. 

In one embodiment, for parenteral administration in an aqueous solution, 
the solution should be suitably buffered if necessary and the liquid diluent first rendered 
isotonic with suf5cient saline or glucose. These particular aqueous solutions are 
30 especially suitable for intravenous, intramuscular, subcutaneous and intraperitoneal 
administration. In this connection, a sterile aqueous mediimi that can be employed wiU 



76 



wo 02/39885 



PCTAJSOl/45395 



be known to those of skill in the art in light of the present disclosure. For example, one 
dosage may be dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml 
of hypodennoclysis fluid or injected at the proposed site of infiision, (see for example, 
"Remington's Pharmaceutical Sciences" 15th Edition, pages 1035-1038 and 1570- 
5 1580). Some variation in dosage will necessarily occur depending on the condition of 
the subject being treated. Moreover, for human administration, preparations will of 
course preferably meet sterility, pyrogenicity, and the general safety and purity 
standards as required by FDA Office of Biologies standards. 

In another embodiment of the invention, the compositions disclosed 

10 herein may be formulated in a neutral or salt form. Illustrative 
pharmaceutically-acceptable salts include the acid addition salts (formed with the firee 
amino groups of the protein) and which are formed with inorganic acids such as, for 
example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, 
tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be 

15 derived from inorganic bases such as, for example, sodium, potassium, ammonium, 
calcium, or ferric hydroxides, and such organic bases as isopropylamine, 
trimethylamine, histidine, procaine and the like. Upon formulation, solutions will be 
administered in a manner compatible with the dosage formxilation and in such amount 
as is therapeutically effective. 

20 The cairi^ can further comprise any and all solvents, dispersion media, 

vehicles, coatings, diluents, antibactoial and antifungal agents, isotonic and absorption 
delaying agents, buffers, carrier solutions, suspensions, colloids, and the like. The use 
of such media and agents for pharmaceutical active substances is well known in the ait 
Except insofar as any conventional media or agent is incompatible with the active 

25 ingredient, its use in the therapeutic compositions is contemplated. Supplementary 
active ingredients can also be incorporated into the compositions. The phrase 
"pharmaceutically-acceptable" refers to molecular entities and compositions that do not 
produce an allergic or similar untoward reaction when administered to a human. 

In certain embodiments, the pharmaceutical compositions may be 

30 delivered by intranasal sprays, inhalation, and/or other aerosol deliv^ vehicles. 
Methods for delivering genes, nucleic acids, and peptide compositions directly to the 
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lungs via nasal aerosol sprays has been described, e.g., in U. S. Patent 5,7565353 and U. 
S. Patent 5,804,212. Likewise, the delivery of drugs using intranasal microparticle 
resms (Takenaga et aL, J Controlled Release 1998 Mar 2;52(l-2):81-7) and 
lysophosphatidyl-glycerol compounds (U. S. Patent 5,725,871) are also well-known in 
5 the pharmaceutical arts. Likewise, illustrative transmucosal drug delivery in the form of 
a polytetrafluoroetheylene support matrix is described in U. S. Patent 5,780,045. 

In certain embodimaits, liposomes, nanocapsules, microparticles, lipid 
particles, vesicles, and the like, are used for the introduction of the compositions of the 
present invention into suitable host cells/organisms. In particular, the compositions of 

10 the present invention may be formulated for delivery either encapsulated in a lipid 
particle, a liposome, a vesicle, a nanosphere, or a nanoparticle or the like. Alternatively, 
compositions of the present invention can be bound, either covalently or non-covalently, 
to the surface of such carrier vehicles. 

The formation and use of liposome and liposome-like preparations as 

15 potential drug carriers is generally known to those of skill in the art (see for example, 
Lasic, Trends Biotechnol 1998 Jul;16(7):307-21; Takakura, Nippon Rinsho 1998 
Mar;56(3):691-5; Chandran et al., Indian J Exp Biol. 1997 Aug;35(8):801-9; Margalit, 
Crit Rev Iher Drug Carrier Syst 1995;12(2-3):233-61; U.S. Patent 5,567,434; U.S. 
Patent 5,552,157; U.S. Patent 5,565,213; U.S. Patent 5,738,868 and U.S. Patent 

20 5,795,587, each specifically incorporated herein by reference in its entirety). 

Liposomes have been used successfully with a number of cell types that 
are normally difGcult to transfect by other procedures, including T cell suspensions, 
primary hepatocyte cultures and PC 12 cells (Renneisen et al, J Biol Oiem. 1990 Sep 
25;265(27): 16337-42; Muller etal, DNA Cell Biol. 1990 Apr,9(3):221-9). hi addition, 

25 liposomes are fiee of the DNA length constraints that are typical of vual-based delivery 
systems. Liposomes have been used effectively to introduce genes, various drugs, 
radiotherapeutic agents, enzymes, viruses, transcription factors, allosteric effectors and 
the like, into a variety of cultured cell lines and animals. Furthermore, he use of 
liposomes does not appear to be associated with autoimmune responses or unacceptable 

30 toxicity after systemic delivery. 
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In certain embodiments, liposomes are formed from phospholipids that 
are dispersed in an aqueous medium and spontaneously form multilamellar concentric 
bilayer vesicles (also termed multilamellar vesicles (MLVs). 

Alternatively, in other embodiments, the invention provides for 
S pharmaceutically-acceptable nanocapsule formulations of the compositions of the 
present invention. Nanocapsules can generally entrap compounds in a stable and 
reproducible way (see, for example, (Juintanar-Guerrero et cd.^ Drug Dev Ind Pharm. 
1998 Dec;24(12):l 113-28). To avoid side efiFects due to intracellular polymeric 
overloading, such ullrafine particles (sized around 0.1 pm) may be designed using 
10 polymers able to be degraded in vivo. Such particles can be made as described, for 
example, by Couvreur etal., Crit Rev Ther Drug Carrier Syst 1988;5(l):l-20; zur 
Muhlen et aU Eur J Pharm Biophami. 1998 Mar;45(2): 149-55; Zambaux et d. J 
Controlled Release. 1998 Jan 2;50(l-3):31-40; and U. S. Patent 5,145,684. 

Cancer Therapeutic Methods 

15 In further aspects of the present invention, the pharmaceutical 

compositions described herein may be used for the treatment of cancer, particularly for 
the immunotherapy of ovarian cancer. Within such methods, the pharmaceutical 
compositions described herein are administered to a patient, typically a warm-blooded 
animal, preferably a human. A patient may or may not be afflicted with cancer. 

20 Accordingly, the above pharmaceutical compositions may. be used to prevent the 
development of a cancer or to treat a patient afOicted with a cancer. Pharmaceutical 
. compositions and vaccines may be administered either prior to or following surgical 
removal of primary tumors and/or treatment such as administmtion of radiotherapy or 
conventional chemotherapeutic drugs. As discussed above, administration of the 

25 pharmaceutical compositions may be by any suitable method, including administration 
by intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal, intradermal, 
anal, vaginal, topical and oral routes. 

Within certain embodiments, immunotherapy may be active 
immunotherapy, in which treatment relies on tiie in vivo stimulation of the endogenous 

30 host immune system to react against tumors with the administration of inmiune 
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response-modifying agents (such as polypeptides and polynucleotides as provided 
herein). 

Within other embodiments, inamxinotherapy may be passive 
mmiunother^y, in which treatment involves the delivery of agents with established 
5 tumor-immune reactivity (such as effector cells or antibodies) that can directly or 
indirectly mediate antitumor effects and does not necessarily depend on an intact host 
immune system. Examples of effector cells include T cells as discussed above, T 
lymphocytes (such as CD8"*" cytotoxic T lymphocytes and CD4"*" T-helper tumor- 
infiltraling lymphocytes), killer cells (such as Natural Killer cells and lymphokine- 

10 activated killer cells), B cells and antigen*presenting cells (such as dendritic cells and 
macrophages) expressing a polypeptide provided herein. T ceU receptors and antibody 
receptors specific for the polypeptides recited herein may be cloned, expressed and 
transferred into other vectors or effector cells for adoptive immunotherapy. The 
polypeptides provided herein may also be used to generate antibodies or anti-idiotypic 

15 antibodies (as described above and in U.S. Patent No. 4,918,164) for passive 
immunotherapy. 

Effector cells may generally be obtained in sufficient quantities for 
adoptive immunotherapy by growth in viiro^ as described hereiiL Culture conditions for 
expanding single antigen-specific effector cells to sevml billion in number with 

20 retention of antigen recognition in vivo are well known in the art Such m vitro culture 
conditions typically use intermittent stimulation with antigen, often in the presence of 
cytokines (such as TL-2) and non-dividing feeder cells. As noted above, 
immimoreactive polypeptides as provided herein may be used to rapidly e^q>and 
antigen-specific T cell cultures in order to generate a sufficient number of cells for 

25 immimotherapy. hi particular, antigen-presenting cells, such as dendritic, macrophage, 
monocyte, fibroblast and/or B cells, may be pulsed with immunoreactive polypeptides 
or transfected Avith one or more polynucleotides using standard techniques well known 
in the art. For example, antigen-presenting cells can be transfected with a 
polynucleotide having a promoter appropriate for increasing expression in a 

30 recombinant virus or other e:q)ression system. Cultured effector cells for use in therapy 
must be able to grow and distribute widely, and to survive long term in vivo. Studies 
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have shown that cultured effector cells can be induced to grow in vivo and to survive 
long term in substantial numbers by repeated stimulation with antigen supplemented 
withIL-2 {see^ for example, Cheever et al., Ifnmunological Reviexvs i 57:177, 1997). 

Alternatively, a vector expressing a polypeptide recited herein may be 
5 introduced into antigen presenting cells taken from a patient and clonally propagated ex 
vivo for transplant back into the same patient Transfected cells may be reintroduced 
into the patient using any means known in the art, preferably in sterile form by 
intravenous^ intracavitary, intraperitoneal or intiatumor admmistration. 

Routes and frequency of administration of the ther^utic compositions 
10 described herein, as well as dosage, will vary from individual to individual, and may be 
readily established using standard techniques. In general, the pharmaceutical 
compositions and vaccines may be administered by injection (e.^., intracutaneous, 
intramuscular, intravenous or subcutaneous), intranasally (e.g., by aspiration) or orally. 
Preferably, between 1 and 10 doses may be administered over a 52 week period. 
15 Preferably, 6 doses are administered, at intervals of 1 month, and booster vaccinations 
may be given periodically thereafter. Alternate protocols may be appropriate for 
individual patients. A suitable dose is an amount of a compound that, when 
administered as described above, is capable of promoting an anti-tumor immune 
refuse, and is at least 10-50% above the basal (ie., untreated) level. Such response 
20 can be monitored by measuring Ifae anti-tumor antibo(£es in a patient or by vaccine- 
dependent generation of cytolytic effector cells capable of killing the patient's tumor 
cells in vitro. Such vaccines should also be capable of causing an immune response that 
leads to an improved clinical outcome (e.g., more frequent remissions, complete or 
partial or longer disease-free survival) in vaccinated patients as compared to non- 
25 vaccinated patients. In general, for pharmaceutical compositions and vaccines 
comprising one or more polypeptides, the amoimt of each polypeptide present in a dose 
ranges from about 25 jig to 5 mg per kg of host. Suitable dose sizes will vary with the 
size of the patient, but will typically range from about 0.1 mL to about 5 mL. 

In general, an appropriate dosage and treatment regimen provides the 
30 active compound(s) in an amount sufficient to provide fhers^utic and/or prophylactic 
benefit Such a response can be monitored by establishing an improved clinical 
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outcome (e.g., more frequent remissions, complete or partial, or longer disease-free 
survival) in treated patients as compared to non-treated patients. Increases in 
preexisting immune responses to a tumor protein generally correlate with an improved 
clinical' outcome. Such immune responses may generally be evaluated using standard 
5 proliferation, cytotoxicity or cytokine assays, which may be pCTformed using samples 
obtained from a patient befr>re and after treatment 

Cancer Detection and Diagnostic Compositions, Methods and Kits 

In general, a cancer may be detected in a patient based on the presence of 
one or more ovarian tumor proteins and/or polynucleotides encoding such proteins in a 

10 biolo^cal sample (for example, blood, sera, sputum urine and/or tumor biopsies) 
obtained from the patient In other words, such proteins may be used as markers to 
indicate the presence or absence of a cancer such as ovarian cancer. In addition, such 
proteins may be useful for the detection of other cancers. The binding agents provided 
herein generally permit detection of the level of antigen that binds to the agent in the 

15 biological sample. Polynucleotide primers and probes may be used to detect the level of 
mRNA encoding an ovarian tumor protein, which is also indicative of the presence or 
absence of a cancer. In general, a ovarian tumor sequence should be present at a level 
that is at least three fold higher in tumor tissue than in normal tissue 

There are a variety of assay formats known to those of ordinary skill in 

20 the art for using a binding agent to detect polypeptide markers in a sample. See^ e.g.y 
Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 
1988. In general, the presence or absaice of a cancer in a patient may be determined by 
(a) contacting a biological sample obtained from a patient with a binding agent; (b) 
detecting in the sample a level of polypeptide that binds to the binding agent; and (c) 

25 comparing the level of polypeptide with a predetermined cut-off value. 

In a preferred embodiment, the assay involves the use of binding agent 
immobilized on a solid support to bind to and remove the polypeptide from the 
remainder of the sample. The bound polypeptide may then be detected using a detection 
reagent that contains a reports group and specifically binds to the binding 

30 agent/polypeptide complex. Such detection reagmts may comprise, for example, a 
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binding agent that specifically binds to the polypeptide or an antibody or oQi&r agent 
that specifically binds to the binding agent, such as an anti-immunoglobiilin, protein G, 
protein A or a lectin. Alternatively, a competitive assay may be utilized, in which a 
polypeptide is labeled with a reporter group and allowed to bind to the immobilized 
5 binding agent after incubation of the binding agent with the sample. The extent to 
which components of the sample inhibit the binding of the labeled polypeptide to the 
binding agent is indicative of the reactivity of the sample with the immobilized binding 
agent Suitable polypeptides for use within such assays include full length ovarian 
tumor proteins and polypeptide portions fliereof to vMch the binding agent binds, as 

10 described above. 

The solid support may be any material known to those of ordinary skill 
in the art to which the tumor protein may be attached. For example, the solid support 
may be a test well in a microtiter plate or a nitrocellulose or other suitable membrane. 
Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a 

15 plastic material such as polystyrene or polyvinylchloride. The support may also be a 
magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. 
Patent No. 5,359,681. The binding agent may be immobilized on the solid support 
using a variety of techniques known to those of skill in the art, which are amply 
described in the patent and scientific literature. In &e context of the present invention, 

20 the term "immobilization" refers to both noncovalent association, such as adsorption, 
and covalent attachment (which may be a direct linkage between the agent and 
functional groups on the support or may be a linkage by way of a cross-linking agent), 
hnmobilization by adsorption to a well in a microtiter plate or to a membrane is 
preferred. In such cases, adsorption may be achieved by contacting the binding agent, in 

25 a sxiitable bxiffer, with the solid support for a suitable amoimt of time. The contact time 
varies with temperature, but is typically between about 1 hour and about 1 day. In 
general, contacting a well of a plastic microtiter plate (such as polystyrene or 
polyvinylchloride) with an amoimt of binding agent ranging from about 10 ng to about 
10 ^g, and preferably about 100 ng to about 1 ^g, is sufiBcient to immobilize an 

30 adequate amount of binding agent 
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Covalent attachment of binding agent to a solid support may generally be 
achieved by first reacting the support with a bifimctional reagent diat will react with 
both the support and a functional group, such as a hydroxyl or amino group, on the 
binding agent For example, the bindmg agent may be covalently attached to supports 
5 having an appropriate polymer coating using benzoquinone or by condensation of an 
aldehyde group on the support with an amine and an active hydrogen on the binding 
partner (see, e.g.. Pierce Immunotechnology Catalog and Handbook, 1991, at 
A12-A13). 

In certain embodiments, the assay is a two-antibody sandwich assay. 

10 This assay may be performed by first contacting an antibody that has been immobilized 
on a solid siq)port, commonly the well of a mia*otiter plate, with the sample, such that 
polypeptides within the sample are allowed to bind to the immobilized antibo(fy. 
Unboimd sample is then removed firom the immobilized polypeptide-antibody 
complexes and a detection reagent (preferably a second antibody capable of binding to a 

15 different site on the polypeptide) containing a reporter group is added. The amoimt of 
detection reagent that remains bound to the solid support is then detemained using a 
method appropriate for the specific reporter group. 

More specifically, once the antibody is inmiobilized on the support as 
described above, the remaining protein binding sites on the support are typically 

20 blocked. Any suitable blocldng agent laiown to those of ordinary skin in the 

bovine serum albumin or Tween 20™ (Sigma Chemical Co., St Louis, MO). The 
unmobilized antibody is then incubated with the sample, and polypeptide is allowed to 
bind to tibie antibody. The sample may be diluted with a suitable diluent, such as 
phosphate-buffered saline (PBS) prior to mcubation. hi genial, an appropriate contact 

25 tune (z.e., incubation time) is a period of time that is sufficient to detect the presence of 
polypeptide within a sample obtained fi-om an individual with ovarian cancer. 
Preferably, the contact time is sufficient to achieve a level of binding that is at least 
about 95% of that achieved at equilibrium between bound and unbound polypeptide. 
Those of ordinary skill in the art will recognize that the time necessary to achieve 

30 equilibrium xxiay be readily determined by assaying the level of binding that occurs ov^ 
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a period of time. At room temperature, an incubation time of about 30 minutes is 
generally suflBcient. 

Unbound sample may then be removed by washing the solid support 
with an appropriate buffer, such as PBS containing 0.1% Tween 20™. The second 
5 antibody, which contains a reporter group, may then be added to the solid support 
Preferred reporter groups include those grou5)s recited above. 

The detection reagent is then incubated with the immobilized antibody- 
polypeptide complex for an amount of time sufficient to detect the bound polypeptide. 
An appropriate amount of time may generally be determined by assaying the level of 

10 binding that occurs over a period of time. Unbound detection reagent is then removed 
and bound detection reagent is detected using the reporter groiqp. The method employed 
for detecting the reporter group depends upon the nature of ttie reporter group. For 
radioactive groups, scintillation counting or autoradiographic methods are generally 
appropriate. Spectroscopic methods may be xised to detect dyes, liuninescent groups 

15 and fluorescent groups. Biotin may be detected using avidin, coupled to a different 
reporter group (commonly a radioactive or fluorescent group or an enzyme). Enzyme 
reporter groups may generally be detected by the addition of substrate (generally for a 
specific period of time), followed by spectroscopic or other analysis of the reaction 
products. 

20 To detemiine the presence or absence of a cancer, such as ovarian 

cancer, the signal detected from the reporter group that remains bound to the solid 
support is generally compared to a signal that corresponds to a predetemiined cut-off 
value. In one preferred embodiment, the cut-off value for the detection of a cancer is 
the average mean signal obtained when the immobilized antibody is incubated with 

25 samples from patients without the cancer. In general, a sample generating a signal that 
is three standard deviations above tiie predetermined cut-off value is considered positive 
for the cancer. In an alternate preferred embodiment, the cut-off value is determined 
using a Receiver Operator Curve, according to the method of Sackett et al.. Clinical 
Epidemiology: A Basic Science for Clinical Medicine^ Littie Brown and Co., 1985, 

30 p. 106-7. Briefly, in this embodiment, the cut-off value may be determined from a plot 
of pairs of true positive rates (ie., sensitivity) and felse positive rates (100%-specificity) 
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that correspond to each possible cut-off value for ttie diagnostic test result The cut-off 
value on the plot that is the closest to the upper left-hand comer (/.e., the value that 
encloses the largest area) is the most accurate cut-off value, and a sample generating a 
signal that is higher than the cut-off value determined by this method may be considered 
5 positive. Alternatively, the cut-off value may be shifted to the left along the plot, to 
minimize the felse positive rate, or to the right, to minimize the false negative rate, hi 
general, a sample generating a signal that is higher than the cut-off value determined by 
this method is considered positive for a cancer. 

In a related embodiment, the assay is performed in a flow-through or 

10 strip test format, wherein the binding agent is immobilized on a membrane, such as 
nitrocellulose. In the flow-through test, polypeptides within the sample bind to the 
immobilized binding agent as the sample passes through the mmbrane. A second, 
labeled binding agent flien binds to the binding agent-polypeptide complex as a solution 
containing the second binding agent flows through the membrane. The detection of 

15 bound second binding ^ent may then be performed as described above. In the strip test 
format, one end of the membrane to which binding agent is bound is immersed in a 
solution containing the sample. The sample migrates along the membrane through a 
region containing second binding agent and to the area of immobilized binding agent 
Concentration of second bmding agent at the area of immobilized antibocfy indicates the 

20 presence of a cancer. Typically, the concentration of second binding agent at that site 
generates a pattern, such as a line, that can be read visually. The absence of such a 
pattern indicates a negative result In general, the amoimt of binding agent immobilized 
on the membrane is selected to generate a visually discernible pattern when the 
biological sample contains a level of polypeptide that would be sufficient to generate a 

25 positive signal in flie two-antibody sandwich assay, in the format discussed above. 
Preferred binding agents for use in such assays are antibodies and antigen-binding 
fragments thereof Preferably, the amount of antibody immobilized on the membrane 
ranges from about 25 ng to about Ijig, and more preferably from about 50 ng to about 
500 ng. Such tests can typically be performed with a very small amount of biological 

30 sample. 
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Of coxirse, numerous other assay protocols exist that are suitable for use 
with the tumor proteins or binding agents of ihe present invention. The above 
descriptions are intended to be exemplary only. For example, it will be apparent to 
those of ordinary skill in the art that the above protocols may be readily modified to use 
5 tumor polypeptides to detect antibodies that bind to such polypeptides in a biological 
sample. The detection of such tumor protein specific antibodies may correlate with the 
presence of a cancer. 

A cancer m^ also, or alternatively, be detected based on the presence of 
T cells that specifically react witii a tumor protein in a biological sample. Within 

10 certain methods, a biological sample comprising €04*^ and/or CDS*^ T cells isolated 
firom a patient is incubated with a tumor polypeptide, a polynucleotide encoding such a 
polypeptide and/or an APC that expresses at least an immimogenic portion of such a 
polypeptide, and the presence or absence of specific activation of the T cells is detected. 
Suitable biological samples include, but are not limited to, isolated T cells. For 

15 example, T cells may be isolated from a patient by routine techniques (such as by 
Ficoll/Hypaque density gradient centrifiigation of peripheral blood lymphocytes). T 
cells may be incubated in vitro for 2-9 days (typically 4 days) at 3TC with polypeptide 
(e.g., 5 - 25 ^g/ml). It may be desimble to incubate another aliquot of a T cell sample in 
the absence of tumor polypeptide to serve as a control. For CD4^ T cells, activation is 

20 preferably detected by evaluating proliferation of the T cells. For CD8* T cells, 
activation is preferably detected by evaluating cytolytic activity. A level of proliferation 
that is at least two fold greater and/or a level of (^olytic activity that is at least 20% 
greater than in disease-fi:ee patients indicates the presence of a cancer in the patient 

As noted above, a cancer may also, or alternatively, be detected based on 

25 the level of mKNA encoding a tumor protein in a biological sample. For example, at 
least two oligonucleotide primers may be employed in a polymerase chain reaction 
(PCR) based assay to amplify a portion of a tumor cDNA derived from a biological 
sample, wherein at least one of the oligonucleotide primers is specific for {Le., 
hybridizes to) a polynucleotide encoding the tumor proteirL The amplified cDNA is 

30 then separated and detected using techniques well known in tiie art, such as gel 
electrophoresis. Similarly, oligonucleotide probes that specifically hybridize to a 



87 



wo 02/39885 



PCT/USOl/45395 



polynucleotide encoding a tumor protein may be used in a hybridization assay to detect 
the presence of polynucleotide encoding the tumor protein in a biological sample. 

To permit hybridization under assay conditions, oligonucleotide primers 
and probes should comprise an oligonucleotide sequence that has at least about 60%, 
5 preferably at least about 75% and more preferably at least about 90%, identity to a 
portion of a polynucleotide encoding a tumor protein of the invention that is at least 10 
nucleotides, and preferably at least 20 nucleotides, in length. Preferably, 
oligonucleotide primers and/or probes hybridize to a polynucleotide encoding a 
polypeptide described herein under moderately stringent conditions, as defined above. 

10 Oligonucleotide primers and/or probes which may be usefully employed in the 
diagnostic methods described herem preferably are at least 10-40 nucleotides in length. 
In a preferred embodunent, the oligonucleotide primers comprise at least 10 contiguous 
nucleotides, more preferably at least 15 contiguous nucleotides, of a DNA molecule 
having a sequence as disclosed herein. Techniques for both PGR based assays and 

15 hybridization assays are well known in the art (see^ for example, Mullis et al.. Cold 
Spring Harbor Symp. Quant. Biol, 51:263, 1987; Erlich ed., PCR Technology, Stockton 
Press, NY, 1989). 

One preferred assay employs RT-PCR, in which PCR is applied in 
conjunction with reverse transcription. Typically, RNA is extracted fiom a biological 

20 sample, such as biopsy tissue, and is reverse transcribed to produce cDNA molecules. 
PCR amplification using at least one specific primer generates a cDNA molecule, whidi 
may be separated and visualized using, for example, gel electrophoresis. Amplification 
may be performed on biological samples taken jQrom a test patient and fix>m an 
individual who is not afflicted with a cancer. The amplification reaction may be 

25 performed on several dilutions of cDNA spanning two orders of magnitude. A two-fold 
or greater increase in expression in several dilutions of the test patient sample as 
compared to the same dilutions of the non-cancerous sample is typically considered 
positive. 

In anotfao: embodiment, the compositions described herein may be used 
30 as markers for the progression of cancer. In Ihis embodimrat, assays as described above 
for the diagnosis of a cancer may be performed over time, and the change in the level of 
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reactive poIypeptide(s) or polyQucleotide(s) evaluated. For example, the assays may be 
performed every 24-72 hours for a period of 6 months to 1 year, and thereafter 
paf oxmed as needed. In general, a cancer is progressing in those patients in whom the 
level of polypeptide or polynucleotide detected increases over time. In contrast, the 
5 cancer is not progressing when the level of reactive polypeptide or polynucleotide either 
remains constant or decreases with time. 

Certain in vivo diagnostic assays may be performed directly on a tumor. 
One such assay involves contacting tumor cells vriith a binding agent The bound 
binding agent may thra be detected directly or indirectly via a reports group. Such 

10 binding agents may also be used in histological plications. Alternatively, 
polynucleotide probes may be used within such applications. 

As noted above, to improve sensitivity, multiple tumor protein markers 
may be assayed within a given sample. It will be apparent that binding agents specific 
for different proteins provided herein may be combined within a single assay. Further, 

15 multiple primers or probes may be used concurrently. The selection of tumor protein 
markers may be based on routine experiments to determine combinations that results in 
optimal sensitivity. In addition, or alternatively, assays for tumor proteins provided 
herein may be combined with assays for other known tumor antigens. 

The present invention further provides kits for use within any of the 

20 above diagnostic methods. Such kits typically comprise two or more components 
necessary for performing a diagnostic assay. Components may be compounds, reagents, 
containers and/or equipment For example, one container within a kit may contain a 
monoclonal antibody or fragment thereof that specifically binds to a tumor protein. 
Such antibodies or firagments may be provided attached to a support material, as 

25 described above. One or more additional containers may enclose elements, such as 
reagents or buffers, to be used in the assay. Such kits may also, or alternatively, contain 
a detection reagent as described above that contains a reporter group suitable for direct 
or indirect detection of antibody binding. 

Alternatively, a kit may be designed to detect the level of mRNA 

30 encoding a tumor protein in a biolo^cal sample. Sudi kits generally comprise at least 
one oligonucleotide probe or primer, as described above, that hybridizes to a 
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polynucleotide encoding a tumor protein. Such an oligonucleotide may be used, for 
example, within a PGR or hybridization assay. Additional components that may be 
present within such kits include a second oligonucleotide and/or a diagnostic reagent or 
container to fecilitate the detection of a polynucleotide encoding a tumor protein. 
5 The following Examples are offered by way of illustration and not by 

way of limitation. 

EXAMPLES 
EXAMPLE 1 

10 IbENTinCATION OF REPRESENTATIVE OVARIAN CARCINOMA CDNA SEQUENCES 

Primary ovarian tumor and metastatic ovarian tumor cDNA libraries 
were each constructed in kanamycin resistant pZErO™-2 vector (Invitrogen) from pools 
of three different ovarian tumor RNA samples. For the primary ovarian tumor library, 
the following RNA samples were used: (1) a moderately differentiated papillary serous 

15 carcinoma of a 41 year old, (2) a stage inc ovarian tumor and (3) a papillary serous 
adenocarcinoma for a SO year old Caucasian. For the metastatic ovarian tumor library, 
the RNA samples used were omentum tissue from: (1) a metastatic poorly 
differentiated papillary adenocarcinoma wi& psammoma bodies in a 73 year old, (2) a 
metastatic poorly differentiated adenocarcinoma in a 74 year old and (3) a metastatic 

20 poorly differentiated papillary adenocarcinoma in a 68 year old. 

The number of clones in each libiaiy was estimated by plating serial 
dilutions of unamplified libraries. Insert data were detemimed from 32 primary ovarian 
tumor clones and 32 metastatic ovarian tumor clones. The library characterization 
results are shown in Table L 

25 
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Table I 

Characterization of cDNA Libraries 



Library 


# Clones 
in Library 


Clones with 
Insert (%) 


Insert Size 
Range (bp) 


Ave. Insert 
Size(bp}^ 


Primary Ovarian Tumor 


1^58,000 


97 


175 - 8000 


2356 


Metastatic Ovarian 
Tiimor 


1,788,000 


100 


150 - 4300 


1755 



Four subtraction libraries were constructed in ampicillin resistant 
pcDNA3.1 vector (Invitrogen). Two of the libraries were fix>m primaiy ovarian tumors 
and two were fit)m metastatic ovarian ttimors. In each case, the number of restriction 
enzyme cuts within inserts was minimized to generate full length subtraction libraries. 
10 The subtractions were each done witihi slightiy different protocols, as described in more 
detail below. 



A. POTS 2 Library: Primary Ovarian Tumor Subtraction Library 

Tracer: 1 0 jig primary ovarian tumor library, digested with Not I 

15 Driver: 35 ng normal pancreas in pcDNA3.1(+) 

20 (xg normal PBMC in pcDNA3.1(+) 
10 ^g normal skin in pcDNA3.1(+) 
35 |ig normal bone marrow in pZErO™-2 
Digested vn± Bam Hl/Xho I/Sca I 
20 Two hybridizations were performed, and Not I-cut pcDNAS. 1(+) was the 

cloning vector for the subtracted library. Sequence results for previously unidentified 
clones that were randomly picked from the subtracted library are presented in Table 11. 



25 



Table n 



Sequence 


SEQIDNO 


21907 


1 


21909 


2 


21911 


5 


21920 


9 


21921 


10 
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OC^UCilvv 


SFO ID NO 








1 AA 








l*rO 


ZDl 1 1 






1 AO 






25116 


151 


25752 


156 


25757 


158 


25763 


160 


25769 


161 


25770 


162 



B. POTS 7 Library: Primary Ovarian Tumor SubtractioD Library 

Tracer: 1 0 primary ovarian tumor library, digested with Not I 

5 Driver: 35 fig normal pancreas in pcDNA3. 1 (+) 

20 ^ig normal PBMC in pcDNA3.1(+) 
10 |ig normal skin in pcDNA3.1(+) 
35 \xg normal bone marrow in pZErO™-2 
Digested with Bam Hl/Xho I/Sca I 
10 -25 ngpZEiO^-2, digested with Bam ffl and Xho I 

Two hybridizations were performed, and Not I-cut pcDNA3.1(+) was the 
cloning vector for the subtracted library. Sequence results for previously unidentified 
clones that were randomly picked fiom the subtracted library are presented m Table m. 

15 Table m 

Ovarian Carcinoma Sequences 



Sequence 


SEQroNO 


24937 


125 


24940 


128 


24946 


132 


24950 


133 


24951 


134 


24955 


136 


24956 


137 


25791 


166 
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Sequence 


SEQIDNO 


25796 


167 


25797 


168 


25804 


171 



C. OS 1 D Library: Metastatic Ovarian Tumor Subtraction Library 

Tracer: 1 0|ag metastatic ovarian library in pZErO™-2, digested 
5 with Not I 

^ Driven 24.Sfig normal pancreas in pcDNA3 . 1 
14\ig normal PBMC in pcDNA3.1 
14^g normal skin in pcDNA3.1 
24.5fig normal bone marrow in pZErO™-2 
10 SO^ig pZErO^-2, digested with Bam Hl/Xho I/Sfii I 

Three hybridizations were performed, and the last two hybridizations 
were done with an additional 15|xg of biotinylated pZErO™-2 to remove contaminating 
p2ErO™-2 vectors. The cloning vector for the subtracted Ubrary was pcDNA3.1/Not I 
cut. Sequence results for previously unidentified clones that were randomly picked 
15 from the subtracted library are presented in Table IV. 



Table IV 
Ovarian Carcinoma Sequences 



Sequence 


SEQIDNO 


23645.1 


13 


23660.1 


16 


23666.1 


19 


23679.1 


23 


24635 


57 


24647 


63 


24651 


65 


24661 


69 


24663 


70 


24664 


71 


24670 


72 


24675 


75 


24683 


78 
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D. OS IF Library: Metastatic Ovarian Tumor Subtraction Library 

Tracer: 1 Ofig metastatic ovarian tumor libraiy, digested with Not I 
Driven 12.8|^ normal pancreas in pcDNA3.1 
5 7.3|jg normal PBMC in pcDNA3.1 

7.3^g normal skin in pcDNA3.1 
12.8jig normal bone marrow in pZErO™-2 
25\ig pZErO™-2, digested with Bam Hl/Xho I/Sfu I 
One hybridization was performed The cloning vector for the subtracted 
10 library was pcDNA3.1/Not I cut Sequence results for previously unidentified clones 
that were randomly picked from the subtracted library are presented in Table V. 

Table V 



Ovarian Carcinoma Sequences 



Sequence 


SEQDDNO 


24336 (79% with H. sapiens mitochondrial genome 
(consensus sequence)) 


27 


24337 


28 


24341 (91%Homo sapiens chromosome S, BAC clone 
249h5(LBNLH149) 


32 


24344 


33 


24348 


35 


24351 


38 


24355 (91% Homo spirals chromosome 17, clone 
hCIT.91_J_4) 


41 


24356 


42 


24357 (87% S. scrofa mKNA for UDP glucose 
pyrophosphoiylase) 


43 


24358 


44 


24359 (78% Human mSNA for KIAAOl 1 1 gene, 
complete cds) 


45 


24360 


46 


24361 


47 


24362 (88% Homo sapiois Chromosome 16 BAC clone 
CIT987SK-A-233A7) 


48 


24363 (87% Homo sapiens eukaryotic translation 
elongation fector 1 alpha 1 CEEFlAl) 


49 


24364 (89%Human DNA sequence fix>m PAC 27K14 on 
chromosome Xpl 1 .3-Xpl 1 .4) 


50 


24367 (89% Homo sapiens 12pl33 BAC RCPIl 1- 


52 
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SFO TD NO 


935C2) 




24368 




24o9U 


01 


24692 


82 


24694 


Oil 

84 


24696 


86 


24699 


89 


24701 


90 


24703 


91 


24704 (88% Homo sapiens chromosome 9, clone 
nKPK.401 G_l 8) 


92 


24705 


93 


24707 


95 


24709 


97 


24711 


98 


24713 


99 


24714 (91%Hmnan DNA sequence from clone 125N5 


100 


on chromosome 6q26-27) 




24717 (89% Homo sapiens proliferation-associated gene 
A (natural Kuier-eiuiancing lactor Aj (rAvjAj 


103 


24727 


107 


24732 


111 


24737 (84%Human ADP/ATP translocase mKNA) 


114 


24741 


117 


24745 


120 


24746 


121 



The sequences m Table VI, which correspond to known sequences, were 
also identified in the above libraries. 

5 

Table VI 
Ovarian Carcinoma Sequences 



Identity 


SEQIDNO 


Sequence 


Library 


H.sapiens DNA for muscle nicotimc acetylcholine 
receptor gene promotor, clone ICRFcl05F02104 


3 


21910 


P0TS2 


Homo salens complement component 3 (C3) gene, 
exons 1-30. 


4 


21913 


P0TS2 


Homo s£5>iens SWI/SNF related, matrix associated, 
actin dependent regulator of chromatin, subfamily a, 
member 4 (SMARCA4) 


6 


21914 


P0TS2 


Human ferritin Heavy subunit mRNA, complete cds. 


7 


21915 


P0TS2 
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Identity 


SEQ ID NO 


Sequence 


Library 


Homo sapiens CGI-151 protein mRNA, complete cds 


o 

8 


21916 


P0TS2 


Human BAG clone GS055K18 from 7pl5-p21 


11 


23636.1 


OSID 


HUMGFIBPA Human growth honnone-dependent 
insulin-like growth jfactor-binding protein 


12 


23637.1 


OSID 


Homo salens nDosomal protein, large, PO (RPLPO) 
mRNA 


14 


23647.1 


OSID 


HUMlKJrMzAHunian lKi^M-2 rnKNA 


15 


23657.1 


/^C 1 

Ol>1D 


HUMMTA Homo sapiens mitochondrial DNA 


17 


23661.1 


OSID 


HSU78095 Homo sapiens placental bikunm mRNA 


18 


23662.1 


OSID 


HUMTI227HC Human mRNA for TI-227H 


20 


23669.1 


OSID 


HUMMTCG Human mitochondrion 


21 


23673.1 


OSID 


Homo sapiens FK506-binding protein lA (12kD) 
(FKBPlA)mKNA 


22 


23677.1 


OSID 


Homo sapiens mRNA for zinc-jQnger DNA-binding 
protein, complete cds 


24 


24333 


OSIF 


Homo sapiens mRNA; cDNA DKFZp564E1962 
(from clone DKFZp564E1962) 


25 


24334 


OSIF 


Homo sapiens tumor protein, translationally- 
controlled 1 (TPTl) mRNA. 


26 


24335 


OSIF 


Homo sapiens interleukin 1 receptor accessory 
protein (DLIRAP) mRNA. 


29 


24338 


OSIF 


Human mRNA for KIAA0026 gene 


30 


24339 


OSIF 


Homo sapiens K-Cl cotransporter KCC4 mRNA, 
complete cds 


31 


24340 


OSIF 


Homo sapiens nuclear chloride ion channel protein 
0^CC27) mRNA 


34 


24345 


OSIF 


Homo ss^iens mRNA for DEPP (decidual protein 
mduced by progesterone) 


36 


24349 


OSIF 


Homo sapiens atrophin-1 mteractmg protem 4 (AIP4) 
mRNA 


37 


24350 


OSIF 


Human collagenase type IV mRNA, 3* end. 


39 


24352 


OSIF 


riuman mKiN A tor i -cell cyclopnilm 


Ai\ 
4U 


24354 


Ublr 


Homo sapiens tumor suppressing subtransferable 
canoiaate i (^looUij 


51 


24366 


OSIF 


Homo sapiens clone 24452 mRNA sequence 


M 


24374 


Ublr 


Homo s^iens eukaryotic translation elongation £ictor 
1 aipna l (liiirlAlj 


55 


24627 


OSID 


Genomic sequence from Human 9q34 


56 


24634 


OSID 


Human insulin-like growth £ictor-binding protein-3 

gene 


58 


24636 


OSID 


Human ribosomal protein L3 mRNA, 3' end 


59 


24638 


OSID 


Homo sapiens annexin JI (lipocortin II) (ANX2) 
mRNA 


60 


24640 


OSID 


Homo sapiens tubulin, alpha, ubiquitous (K-ALPHA- 

1) 


61 


24642 


OSID 
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laenuty 


SEQ ID NO 


Sequence 


Library 


Human non-histone chromosomal protein HMG-14 

mKJNA 


62 


24645 


OSID 


Homo sapiens ferritin, heavy polypeptide 1 (FTHl) 


o4 


24o4o 


OSID 


rlomo sapiens Izpli.i rAC KrCll-yoJdIi/ (Kosweli 
r anc uancer msiiiuie riuman r ACJLiDraiyj 


00 


24553 


OSID • 


Jtiomo sapiens i ceu-specmc i3rrosme Kinase mKJNA 


0/ 


z4o!>j 




iiomo salens Kerann i o \^jsJvI i oj mKiN a 


Oo 


24o2>/ 




iiomo sapiens gro wm arresi specinc iranscnpx j gene 


ID 


z4o/l 




Homo sapiens ribosomal protein S7 (RPS7) 


74 


24673 


OSID 


nemo sapiens mKIMA; cUjnA DKrZpjo4Hlo2 


70 


24677 


OSID 


Human IbC-zz protem mKNA 


11 


24679 


/AC 1 TPV 

OSID 


Human mRNA for ribosomal protein 


79 


24687 


OSID 


Genonuc sequence uom Human 13 


80 


24689 


OSIF 


Homo sapiens clone IMAGE 286356 


83 


24693 


OSIF 


Homo sapiens v-fos FB J munne osteosarcoma viral 

1 . _ 1 ^ ^/Tr»^~\ \ -VTA 

oncogene homolog(FOS) mRNA 


85 


24695 


OSIF 


Homo sapiens hypothetical 43.2 Kd protem mRNA 


87 


24697 


OSIF 


Human heat shock protein 27 (HSPBl) gene exons 1- 
3 


88 


24698 


OSIF 


Homo sapiens senescence-associated epitnehal 
membrane protem (SEMrl) 


94 


24706 


OSIF 


Human ferritin H chain mRNA 


96 


24708 


OSIF 


TT ; T>^ T A T^T A A 0^7 

Homo sapiens mKNA for KIAA0287 gene 


101 


24715 


OSIF 


Homo sapiens CGI-08 protem mRNA 


102 


24716 


OSIF 


H.sapiens CpG island DNA genomic Msel fragment, 
clone 84aS 


104 


24719 


OSIF 


iiuman clone Zd /zz mKNA 


1 AC 


24721 


OSIF 


Homo sapiens zinc finger protein slug (SLUG) gene 


lUo 


24722 




Homo sapiens (clone L6) E-cadherin (CDHl) gene 


108 


24728 


OSIF 


Homo sapiens nbosomal protem LI 3 (RPL13) 


109 


24729 


OSIF 


H.sapieDS KNA tor snRNP protein B 


110 


24730 


OSIF 


Homo sapiens mKNA; cDNA DKFZp434K114 


112 


24734 


OSIF 


Homo sapiens cormchon protem mRNA 


113 


24735 


OSIF 


TT^ • t J.* n /TVT^nnoN t^'v t a 

Homo sapiens keratm 8 (KRT8) mRNA 


115 


24739 


OSIF 


Human DNA sequence from PAC 29K1 on 
chromosome 6p2 1.3-22.2. 


116 


24740 


OSIF 


Homo sapiens mRNA for KIAA0762 protein 


118 


24742 


OSIF 


Human clones 23667 and 23775 zmc finger protem 

mJKJNA 


119 


24744 


OSIF 


Human HI 9 RNA gene, complete cds. 


122 


24933 


P0TS7 


Human triosephosphate isomerase mRNA, complete 
cds. 


123 


24934 


P0TS7 


Human cyclooxygenase-1 (PTSGl) mRNA, partial 
cds 


124 


24935 


P0TS7 


Homo sapiens megakaryocyte potentiating factor 


126 


24938 


P0TS7 
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Identity 


SEQDNO 


Sequence 


Library 


^IVlrr j mKJNA. 








rlxmian mKiNA lor Apoljnluman (jyiJbiO(Aopl- 
iVL0uscj*jiiKe pioiciDj, compieie cos 


12/ 


24yjy 


rUlo/ 


Homo sapiens aiylacetamide deacetylase (esterase) 
/'A A'^A^*^ wiPxta 

{J\J\UJ\\^) niKJNA. 


129 


24942 


P0TS7 


xiomo sapiens ecninoaenii nucrotuDiue-associaiea 
protein-like EMAP2 mRNA, complete cds 




24y4j 


rUlo/ 


xiomo Sapiens poaocoiyxm-iiKe \ryjiJ.Aj^) mKiNA, 


111 




POTQ7 

JrVy lo / 


Homo sapiens synaptogyrin 2 (SYNGR2) mRNA. 


135 


24952 


P0TS7 


Homo sapiens amyloid beta precursor protein-binding 
protein 1, 59kD (APPBPl) mRNA. 


loo 


JAyOy 


rUlo7 


Human aldose reductase mRNA, complete cds. 


\iy 




rUlo/ 


Genomic sequence from Human 9q34, complete 
sequence [Homo sapi^] 


140 


25092 


P0TS2 


Human glyceraldehyde-3-phosphate dehydrogenase 
(GAPDH) mRNA, complete cds. 


141 


25093 


P0TS2 


Homo sapiens breast cancer suppressor candidate 1 
(DCSC-1) mRNA, complete cds 


142 


25098 


P0TS2 


Homo sapiens SKBl (S. cerevisiae) homolog (SKBl) 

mRNA. 


147 


25110 


P0TS2 


Homo sapiens prepro dipeptidyl peptidase I (DPP-I) 
gene, complete cds 


152 


25117 


P0TS2 


Homo sapiens preferentially expressed antigen of 
melanoma (PRAME) mRNA 


153 


25745 


POTS2 


Human translocated t(8;14) c-myc (MYC) oncogene, 
exon 3 and complete cds 


154 


25746 


r(JTo2 


Human 12S RNA mduced by poly(rI), poIy(rC) and 
Newcastle disease virus 


155 


25749 


POTS2 


Human mRNA for fibronectin (FN precursor) 


157 


25755 


r(jro2 


Homo sapiens mRNA for hepatocyte growth fector 
activator inhibitor type 25Complete cds 


159 


25758 


P0TS2 


Homo sapiens mRNA lor K1AAU352 protein, 
complete cds 


163 


OCT '71 

25111 


rOTS7 


Homo sapiens IMP (inosine monophosphate) 
dehydrogenase 2 (IMPDH2) mRNA 


io4 


25775 


rKJihl 


Homo sapiens clone 235^42 alpna enolase mRNA, 


lOJ 


25/57 


rUlb/ 


H.sapiens vegf geaie, 3*UTR 


169 


25799 


P0TS7 


Homo sapiens 30S ribosomal protein S7 homolog 
mRNA, complete cds 


170 


25802 


P0TS7 


Homo sapiens acetyl-Coenzyme A acetyltransferase 2 
(acetoacetyl Coenzyme A thiolase) (ACAT2) mRNA 


172 


25808 


P0TS7 


Homo sapiens Nonie disease protein (NDP) mRNA 


173 


25809 


P0TS7 
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Still further ovarian carcinoma polynucleotide and/or polypeptide 
sequences identified firom the above libaries are provided below m Table Vn. 
Sequences 0574S (SEQ ID NO: 183 & 185), 0584S (SEQ ID NO:193) and 0585S 
(SEQ ID NO: 194) represent novel sequences. The remaining sequences exhibited at 
S least some homology with known genomic and/or EST sequences. 



Table vn 



SEQ ID: 




Library 

jLJm/x CSX y 


174 

1 /*T 




0565S CRABP 


OSID 


17S 
1 ij 




0566S Cenilonlasmin 


P0TS2 


X / □ 




0567S 41191 SEOfl>48T> 


P0TS2 


177 

III 




0568S KIAA0762 seaf 1>3999'> 


P0TS7 


1 /o 




0569S 41220 seafl>1069'fc 


P0TS7 


179 




O570S 41215 sean>1817> 

V/^ / viJ ^X^X*/*OVlJ^^X'^ XUX / J 


POTS2 

X V/ X 


loU. 




0571S_41213.seq(l>2382) 


P0TS2 


181 




0572S 41208 sea('l>2377'» 


POTS2 


182 




O573S_41177.seq(l>1370) 


OSIF 


183 






n/fl 


184 




0568SA^SGFDNAseq 


n/a 


185: 




0574S 47807 sean>3000'> 


n/a 


186: 




0568SA^SGF protein seq 


n/a 


187 




449H1(57581) 


OSID 


188: 




451E12(57582) 


OSID 


189 




453C7 3'(57583.1)Osteonectin 


OSID 


190 




453C7 5'(57583.2) 


OSID 


191: 




456G1 3'(57584.1)Neurotensin 


OSIF 


192: 




456G1 5'(57584.2) 


OSIF 


193: 




0584S_^465G5(57585) 


OSIF 


194: 




0585S 469612(57586) 


P0TS2 


195: 




0569S_474C3(57587) 


P0TS7 


196: 




483B1 3'(24934.1)Triosephosphate 


P0TS7 


197: 




57885 Human preferentially 
expressed antigen of melanoma 


P0TS2 


198: 


57886 Chromosome 22ql2.1 clone 

CTA-723E4 


P0TS2 


199: 


57887 Homologous to mouse brain 
cDNA clone MNCb-0671 


P0TS2 



10 Further studies on the clone of SEQ ID NO: 182 (also referred to as 

0573 S) led to the identification of multiple open reading fiames that encode the amino 
acid sequences of SEQ ID NO:200-202. 
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EXAMPLE2 

Analysis of cDNA Expression Using Microarray Technology 

In additional studies, sequences disclosed herein were found to be 
overexpressed in specific tumor tissues as determined by microarray analysis. Using 
5 this ^proach, cDNA sequences are PGR amplified and their mRNA expression profiles 
in tumor and normal tissues are examined using cDNA microarray technology 
essentially as described (Shena et al., 1995). hi brief, the clones are arrayed onto glass 
slides as multiple replicas, with each location correspondmg to a unique cDNA clone 
(as many as SSOO clones can be arrayed on a single slide or chip). Each chip is 

10 hybridized with a pair of cDNA probes that are fluorescence-labeled with Cy3 and Cy5 
respectively. Typically, l\ig of polyA* RNA is used to generate each cDNA probe. 
After hybridization, the chips are scanned and the fluorescence intensity recorded for 
both Cy3 and Cy5 channels. There are multiple built-in quality control steps. First, the 
probe quality is monitored using a panel of ubiquitously expressed genes. Secondly, the 

15 control plate also can include yeast DNA fragments of which complementary RNA may 
be spiked into the probe synthesis for measuring the quality of the probe and the 
sensitivity of the analysis. Currently, the technology offers a sensitivity of 1 in 100,000 
copies of mRNA. Finally, the reproducitility of this technology can be ensured by 
including duplicated control cDNA elements at different locations. 

20 The microarray results for clones 57885 (SEQ ID NO:197), 57886 (SEQ 

ID NO:198) and 57887 (SEQ ID NO:199) are as follows. 

Clone 57885: 16/38 (42%) of ovarian tumors showed an e}q>ression 
signal value of >0.4. The mean value for all ovaiy tumors was 0.662 with a mean value 
of 0.187 for all normal tissues, which yields a 3.64 fold overexpression level m ovary 

25 tumor relative to essential normal tissues. Normal tissue expression was elevated (>0.4) 
in peritoneum, skin and thymus. 

Clone 57886: 16/38 (42%) of ovarian tumors showed an expression 
signal value of >0.4. The mean value for all ovary tumors was 0.574 with a mean value 
of 0.166 for all normal tissues which yields a 3.46 fold overexpression level in ovary 

30 tumor relative to essential normal tissues. Normal tissue expression was elevated (>0.4) 
in heart, pancreas and small intestive. 
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Clone 57887: 17^8 (44%) of ovarian tumors showed an expression 
signal value of >0.4. The mean value for all ovaiy tumors is 0.744 with a mean value of 
0.184 for all normal tissues which yields a 4.04 fold overexpression level in ovary 
tumor relative to essential nonnal tissues. Normal tissue expression was elevated (>0.4) 
5 in esophagus. 

EXAMPLES 

Expression of recombinant antigen 0568S in e. coli 

This example describes tiie expression of recombinant antigen 0568S 

10 (SEQ ID NO: 177) in E. coli. This sequence was identified in Example 1 from the 
POTS 7 subtraction library using primary ovarian tumor cDNA as the tracer. PGR 
primers specific for fiie open reading firame of 0568S w^ designed and used in the 
specific amplification of 0568S. The PGR product was enzymadcally digested with 
EcoRI and ligated into pPDM, a modified pET28 vector which had been cut with the 

15 restriction enzymes EcoRI and Eco72L The construct sequence and orientation was 
confirmed through sequence analysis, the sequence of which is shown in SEQ ID 
NO:206. The vector was then transformed into the expression hosts, BLR (DE3) and 
HMS 174 (DE3) pLys S. Protein expression was confirmed, the sequence of which is 
provided in SEQ ID NO:207. 

20 

EXAMPLE4 
ADDmoNAL Sequence Obtained for Glone 0591 S 
The sequence of 0591 S (clone identifier 57887) was used to search 
public sequence databases. It was found that the reverse strand showed some degree of 
25 identity to the C-terminal end of GPR39. The cDNA for the coding region of GPR39 is 
disclosed in SEQ ID NO:208 and the corresponding amino acid sequence in SEQ ID 
NO:209. The GPR39 codmg region contains two exons. Both 0591 S and GPR39, 
encoded by the complementary strand of 0591 S, are located on chromosome 2. 
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EXAMPLES 

Further characterization of 059 1 S and identificaton of extended seqeunce 
O1034C is an ovaiy specific gene identified by electronic subtraction. 
Briefly, electronic subtraction involves an analysis of EST database sequences to 
5 identify ovarian-specific genes. In the electronic subtraction method used to indentify 
O1034C, sequences of EST clones d^ived firom ovary bbraries (normal and tumor) 
wore otained firom the GenBank public human EST database. Each ovary sequence was 
used as a "seed" query in a BLASTN search of the total human EST database to identify 
other EST clones that share sequence with the seed sequence (clones that potentially 

10 originated fix)m the same mKNA). EST clones with shared sequence were grouped into 
clusters, and clusters that shared sequence with dther clusters were grouped into 
superclusters. The tissue source of each EST within each supercluster was noted, and 
superclusters were ranked based on the distribution of the tissues fix>m which the ESTs 
originated Superclusters that comprise primarily, or solely, EST clones from ovary 

15 libraries were considered to represent genes that were differentially expressed in ovary . 
tissue, relative to all other normal adult tissue. 

This clone was identified from the public EST databases as Integrated 
Molecular Analysis of Genomics and their Expression (IMAGE) clone number 595449 
(the IMAGE consortium is a repository of EST clones and cDNA clones) and is 

20 disclosed as SEQ ID NO:210. Accession numbers AA173739 and AA173383 
represents the sequence of the identified EST in Genebank. This clone is part of 
Unigene cluster HS.85339 (Unigene is an e3q)erimental system for automatically 
partitioning Grabank sequences into a non-redundant set of gene-orientated clusters) 
and was annotated as encoding a neurotensin-like G protein coupled receptor (GRP39). 

25 However, the inventors have discovered that IMAGE#595449 encodes a novel protein 
derived from the complementary strand to that which encodes the potential GPR39. 

Microarray analysis of the clone using a series of ovary tumor specific 
probes indicated that this clone was over expressed 4.95-fold in a group of ovary tumor 
and normal ovary samples as compared to a group of essential normal tissue samples. 

30 IMAGE#S9449 was subjected to a Blast A search of the EST database 

and Genbank and an electronic fiill length clone contig (O1034C) was generated by 
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extending IMAGE#595449 and its resulting contigs to completeion. This process was 
repeated to completion when no further EST sequences were identified to extend the 
consensus sequence. This electronically derived clone was identified as coding a 
previously described clone, 0591S, the sequence of which is disclosed in SEQ E) 
5 N0:211. The discovery of this ovary specific candidate is described in more detail in 
Example 4. 

The consensus sequence for O1034C extended finlher 5' than 0591 S due 
to the additional sequences derived firom two EST clones, accession mmibers BF345141 
and BE336607, the sequences for which are disclosed in SEQ ID NO:212 and 213 

10 respectively. Although BF345141 diverges firom the O1034C/O591S consensus at its 
3'-end (possibly representing a different splice form), and fix)m BE336607 at several 
bases at its 5 '-end, the two ESTs were compared to the available matching chromosome 
sequence. They were found on human chromosome 2, clone RPl l-159N20:htgs 
database accession number ACO 10974. These sequences were used to extend 

15 O1034C/O591S to form a final consensus sequence for O1034C/O591S of 1897 base 
pairs, disclosed in SEQ ID NO:214. 

An open reading frame (ORF) was identified within the O1034C/O591S 
consensus sequence (nucleotides 260-682), the predicted translation of which is 
disclosed in SEQ ID NO:215. A BLASTx database search against the Genbank 

20 database indicated that this ORF had no identity (E value <le-25) with any known 
human protein. The only match was with the G protein-coijpled receptors, including 
GPR39, which the inventors have shown to be encoded at the 3'-end of O1034C/O591S 
on the complementary strand. However, the ORF did encode a protein that had 93% 
similarity (131/141 amino acids) and 91% identity (129/141 amino acids) with an un- 

25 named murine product (Accession #BAA95101), suggesting that this is a real 
translation product that represents a novel human ovary-specific antigen. 

The novelty of O1034C/O591S was confirmed by Northern Blot analysis 
using single stranded probes that complement either GRP39 or O1034C/O591S, The 
strand-specific O1034C/O591S probe specifically hybridized to the ovary tumor 

30 samples probed on the Norfhen blot, whilst all samples were negative ^en probed with 
GPR39. In addition real-time PGR was performed usmg primers specific for either 
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GPR39 or O1034C/O591S. These results further demonstrated the differential 
expression profiles of the two sequences. This protein is a putative membrane protein 
as determined bom Corixa's Tmpred protein prediction algorithm. 

5 EXAMPLE6 

Expression Analysis and Further Characterization of 

Ovarian Sequence 0568S 
The ovarian sequence 0568S was originally identified as cDNA clone 
24742 (SEQ ID NO: 11 8). Using clone 24742 as a query sequence to search public 
10 sequence databases, the sequence was found to have a high degree of homology with 
KIAA0762 (SEQ ID NO:177) and with VSGF. The DNA sequence for VSGF is 
provided in SEQ ID 184 and the VSGF protein sequence is provided in SEQ ID 
NO:186. 

Real-time PGR (see Gibson et al.. Genome Research 5:995-1001, 1996; 

15 Held et al., Genome Research 5:986-994, 1996) is a technique that evaluates the level of 
PGR product accumulation during amplification. This technique permits quantitative 
evaluation of mRNA . levels in multiple samples. Briefly, mRNA is extracted from 
tumor and normal tissue and cDNA is prepared using standard techniques. Real-time 
PGR is performed, for example, using a Perkin Elmei/Applied Biosystems (Foster City, 

20 CA) 7700 Prism instrument. Matching primers and fluorescent probes are designed for 
genes of mterest usmg, for example, the primer express program provided by Perkin 
Ehner/Applied Biosystems (Foster City, CA). Optimal concentrations of primers and 
probes are initially determined by those of ordinary skill in the art, and control (e.g., P- 
actin) primers and probes are obtained commercially fix)m, for example, Perkin 

25 Ehner/Applied Biosystems (Foster City, CA). To quantitate the amount of specific 
RNA in a sample, a standard curve is generated using a plasmid containing the gene of 
interest Standard curves are generated using the Gt values determined in the real-time 
PGR, which are related to the initial cDNA concentration used in the assay. Standard 
dilutions rangmg fiom 10-10^ copies of the gsnt of interest are generally sufllcient. In 

30 addition, a standard curve is generated for flie control sequence. This permits 
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Standardization of initial RNA content of a tissue sample to tibie amount of control for 
comparison purposes. 

By RealTime PGR analysis, 0568 was highly overexpressed in the 
majority of ovary tumors and ovary tumor metastases tested relative to normal ovary 
5 tissue and relative to an extensive normal tissue paneL Little or no expression was 
observed ui normal esophagus, spinal cord, bladder, colon, liver, PBMC (activated or 
resting), lung, skin, small intestine, stomach, skeletal muscle, pancreas, dendritic cells, 
heart, spleen bone marrow, thyroid, trachea, thymus, bronchia, cerebellum, ureter, 
uterus and p^toneum epitheliimi. Some low level expression was observed in normal 

10 breast, brain, bone, kidney, adrenal gjand and salivary gland, but the expression levels 
in these normal tissues were generally at least several fold less than the levels observed 
in ovary tumors overexpressing 0568S. 

Moreovo*, a series of Northern blots was performed which also 
demonstrated that the ORF region of 0568S is specifically overexpressed in ovary 

15 tumors. The initial blot contained RNA fi'om a series of normal tissues as well as fi'om 
ovary tumors. This blot was probed using, as a labeled probe, DNA from 0568S that 
corresponded to the 3'IJTR of the VSGF sequence disclosed in SEQ ID NO: 184. This 
blot revealed an ovary tumor-specific 5.0Kb message as well as a potential 3.5Kb brain 
specific message and a ubiquitously expressed 1 .35 Kb message. 

20 Another Northrai blot was performed with RNAs fit>m a number of 

different brain tissues and probed with the 3VTR region as above. Five of eleven brain 
samples showed overexpression of the 3.5Kb message. In order to determine whetha: 
the ORF region of 0568S was specifically overe3q)ressed in ovary tumors, a series of 
three blots was carried out using three separate probes designed fi-om within the VSGF 

25 ORF of 0568S. Results firom these experiments clearly indicated that only the 5.0Kb 
message is expressed in ovary tumor. 

EXAMPLE? 
Synthesis of Polypeptides 
30 Polypeptides are synthesized on a Perkin Elmer/Applied Biosystems 

Division 430A peptide synfliesizer using EMOC chenxistty with HPTU (O- 
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Benzotriazole-NjNJsTJsf -tetramethyliiroiiium hexafluorophosphate) activation. A Gly- 
Cys-Gly sequence is attached to the amino terminus of the peptide to provide a method 
of conjugation, binding to an immobilized surfece, or labeling of the peptide. Cleavage 
of the peptides from the solid support is carried out using the following cleavage 
5 mixture: trifluoroacetic acid:ethanedithiol:thioanisole:water:phenol (40:1:2:2:3). After 
cleaving for 2 hours, the peptides are precipitated in cold methyl-t-butyl-ether. The 
peptide pellets are then dissolved in water containing 0.1% trifluoroacetic acid (TFA) 
and lyophilized prior to purification by CI 8 reverse phase HPLC. A gradient of 0%- 
60% acetonitrile (containmg 0.1% TFA) in water (containing 0.1% TFA) is used to 
10 elute the peptides. Following lyophilization of the pure fractions, the peptides are 
characterized using electrospmy or other types of mass spectrometry and by amino acid 
analysis. 

EXAMPLE 8 

15 0568S Northern Blot Analysis 

As described in Example 6, Northern blot analysis demonstrated that the 
ORF region of 0568S was specifically over expressed in ovarian tumors. The original 
probe used corresponded to the 3'UTR of the VSGF sequence disclosed in SEQ ID 
N0:1 84. The results from these Northern blots revealed an ovarian tumor-specific 5.0 

20 Kb message as well as a potential 3.S Kb brain specific mess^e. To confirm that the 
entire region covered by the ORF yields a single 5.0 Kb ovarian tumor-specific 
message, two additional probes were designed. The probes were located at the 5' and 3' 
regions of the ORF. Northern blot analysis using these two probes demonstrated that 
both probes hybridized to a 5.0 Kb product present only in ovarian tumor samples. 

25 Both probes failed to hybridize with RNA derived from multiple brain samples. 

EXAMPLE 9 

Real Time PGR And Northern Blot Analysis Of O590S 
Real time PGR analysis of ovarian tumor antigen O590S was performed 
30 essentially as described in Example 6. O590S specific primers and probe were designed 
and quantitative Real Time PGR was performed on a panel of cDNAs prepared &om a 



106 



wo 02/39885 



PCTAJSO 1/45395 



variety of tissues including ovarian tumor samples and a panel of normal tissues. This 
analysis revealed that O590S-specific mRNA was over expressed in approximately 65% 
of ovarian tumor samples tested, 100% tumor samples derived from SCED mice, and 
100% ovarian tumor cell lines tested, when compared to normal ovarian tissue. No 
5 detectable expression was observed m normal tissues. 

In addition to Real Time PCR, Northern blot analysis was performed to 
determine to transcript size of O590S. The Northern blot was probed with a 537 bp 
PGR product specific for O590S, which was designed to avoid regions of repeat 
sequences. This probe revealed a smeared band that was q)proximately 9.0 Kb in size, 
1 0 which was present in the majority of ovarian tumor samples tested. 

From tiie foregoing it will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
invention. Accordingly, the invention is not limited except as by the appended claims. 
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CLAIMS 

What is Claimed: 

1 . An isolated polypeptide comprising an amino acid sequence of an . 
ovarian tumor protein selected fiom the group consisting of: 

(a) sequences provided m SEQ ID NOs:215, 200-202, 207, and 209; 

(b) sequences having at least 70% identity to a sequence provided in 
SEQ ID NOs: 215, 200-202, 207, and 209; and 

(c) sequences having at least 90% identity to a sequence provided m 
SEQ ID NOs: 215, 200-202, 207, and 209. 

2. An isolated polynucleotide comprising a sequence selected from 
the group consisting of: 

(a) sequences provided in SEQ ID NO: 214, 203-206, 208, and 210- 

213; 

(b) complements of the sequences provided m SEQ ID NO: 214, 
203-206, 208, and 210-213; 

(c) sequences consisting of at least 20 contiguous residues of a 
sequence provided in SEQ ID NO: 214, 203-206, 208, and 210-213; 

(d) sequences that hybridize to a sequence provided in SEQ ID NO: 
214, 203-206, 208, and 210-213 under moderately stringent conditions; 

(e) sequences having at least 75% identity to a sequence provided in 
SEQ ID NO: 214, 203-206, 208, and 210-213; 

(f) sequences having at least 90% identity to a sequence provided in 
SEQ ID NO: 214, 203-206, 208, and 210-213 and 

(g) degenerate variants of a sequence provided in SEQ ID NO: 214, 
203-206, 208, and 210-213. 
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3 . An isolated polypeptide comprising an amino acid sequence of an ■ 
ovarian tumor protein selected fix>m the group consisting of: 

(a) sequences encoded by a polynucleotide of claim 2; 

(b) sequences having at least 70% identify to a sequence encoded by 
a polynucleotide of claim 2; and 

(c) sequences having at least 90% identity to a 
sequence encoded by a polynucleotide of claim 2. 



4. An expression vector comprising a polynucleotide of claim 2 
operably linked to an expression control sequence. 

5. A host cell transformed or transfected with an e?q)ression vector 
according to claim 4. 

6. An isolated antibody, or antigen-binding fragment thereof, that 
specifically binds to a polypeptide of claim 1 or claim 3. 

7. A method for detecting the presence of an ovarian cancer in a 
padent, comprising the steps of: 

(a) obtaining a biological sample from tibie patient; 

(b) contacting the biological sample with a binding agent that binds 
to a polypeptide of claim 1 or claim 3; 

(c) detectmg in the sample an amount of polypeptide that binds to 
the binding agent; and 

(d) comparing the amount of polypeptide to a predetermined cut-off 
value and therefrom determining the presence of a cancer in the patient 

8. A fusion protein comprising at least one polypeptide according to 
claim 1 or claim 3. 
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9. An oligonucleotide that hybridizes to a sequence recited in SEQ 
ID NO: 214, 203-206, 208, and 210-213 under moderately stringent conditions. 

10. A niethod for stimulating and/or expanding T cells specific for a 
tumor protein, comprising contacting T cells with at least one component selected from 
the group consisting of: 

(a) polypeptides according to claim 1 or claim 3; 

(b) polynucleotides according to claim 2; and 

(c) antigen-presenting cells that express a polypeptide according to 
claim 1 or claim 3, 

under conctitions and for a time sufGcient to permit the stimulation 
and/or expansion of T cells. 

11. An isolated T cell population, comprising T cells prepared 
according to the method of claim 10. 

12. A composition comprising a first component selected from the 
group consisting of physiologically acceptable carriers and immimostimulants, and a 
second component selected firom the group consisting of: 

(a) polypeptides according to claim 1 or claim 3; 

(b) polynucleotides according to claim 2; 

(c) antibodies according to claim 6; 

(d) fusion proteins according to claim 8; and 

(e) antigen presenting cells that express a polypeptide according to 
claim 2 or claim 3. 

13. A method for stimulating an immune response in a patient, 
comprising administering to the patient a composition of claim 12. 

14. A metiiod for the treatment of a cancer in a patient, comprising 
administering to the patient a composition of claim 12. 
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15. A method for detennining the presence of a cancer in a padent, 
comprising the steps of: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with an oligonucleotide 
according to claim 9; 

(c) detecting in the sample an amomit of a polynucleotide that 
hybridizes to fhe oligonucleotide; and 

(d) compare the amount of polynucleotide that hybridizes to flie 
oUgonucleotide to a predetermined cut-ofiF value, and therefrom detennining the 
presence of the cancer in the patient 

16. A diagnostic kit comprising at least one oligonucleotide 
according to claim 9. 

17. A diagnostic kit comprising at least one antibody according to 
claim 6 and a detection reagent, wherein the detection reagent comprises a reporter 
groiq). 

18. A method for inhibiting the development of a cancer in a patient, 
comprising the steps of: 

(a) incubating CD4+ and/or CD8+ T cells isolated from a patient 
with at least one component selected from the group consisting of: (i) polypeptides 
according to claim 1 or claim 3; (ii) polynucleotides according to claim 2; and (iii) 
antigen presenting cells that express a polypeptide of claim 1 or claim 3, such that T cell 
proliferate; 

(b) administering to the patient an effective amount of the 
proliferated T cells, 

and thereby inhibiting the development of a cancer in the patient. 
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SEQUENCE LISTING 

<110> Corixa Cojrporation 
Xu, Jiangchxan 
Stolk^ John A. 
Algate/ Paul A. 
Fling, Steven P. 
Molesh, David Alan 



<120>- COMPOSITIONS AND METHODS FOR THE THERAPY 
AND DIAGNOSIS OF OVARIAN CANCER 



<130> 210121. 48401PC 

<140> PCT 

<141> 2001-11-13 

<160> 215 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 303, 370, 377, 382 
<223> n - A,T,C or G 

<400> 1 

caacctcact agtaaatgaa agaaatattg taatttgtat ttgatctgct gggtctttgg 60 
agtcagaact ggttttatca gcagtttgat cttctgaggt ctggtatgta gtttgctggc 120 
ccacagaacc ttcacgtgta ttcacagcct caatgccata aggaaactct tttagaagtt 180 
ctgacagctg gtcatgtagg tataagacag gtgccttatc actgtggatt tcatttcttg 240 
caggatcttg gggagtatag ttgctggatg catctatttc ctgagggtaa atatcctcct 300 
ggncgacgcg gccgctcgag tctagagggc ccgtttaaac ccgctgatca gcctcgactg 360 
tgccttctan ttgccancca tntgttgttt gcccct 396 

<210> 2 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<400> 2 

cgaccaaaaa gtaaactcca agtgaacatc aaatcaaatc taatcctttt ggccacatga 60 

ctggttgttc tttatctcat agttacaatg aatcatataa actgtagact gccactacca 120 

cgatacttct gtgacacaga aggaatgtcc tatttgccta tctatctgag gaatgttaaa 180 

tagagaaaaa tagattataa aacaacctgg aggtcacagg attctgagat aatccctctg 240 

ttaaaaaaca tctgaacagc aaatgtccaa tctgtaataa aatagttaaa ggtccaagtc 300 

aagtccactt ctacttggct ggcccagcac aagaaatcta acagcacttt gtaatcattt 360 
tgcttttcta attttcccgg aggacatggg ccattg 396 



<210> 3 
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<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc^feature 

<222> 22, 28, 29, 30, 33, 36, 41, 43, 45, 46, 53, 56, 58, 61, 64, 
69, 70, 74, 75, 78, 83, 84, 85, 102, 143, 335 
<223> n « A,T,C or G 



<400> 3 

cgcccttttt tttttttttt 

haancccann cccnnggnac 
gggtgagggg gcatctactc 
tgacctgcct tggagcctta 
ctgaggctat ggaagtctcc 
cccctagcac tccacgtttt 
cccagctgag gttgtgagct 

<210> 4 
<211> 396 
<212> DNA 

<213> Homo sapiens 



tnattggnnn aantcncttt 
cannnccagg agttgggtgg 
ctnttgcaac aagccaaaag 
gtccctccct tagggccccc 
ctcctagttc actagcaggt 
tctgaaaaaa tctanacagg 
tgtaaggtaa agcaag 



nantnnaaaa acntgnangg 60 
anactgagtg gggtttgtgt 120 
tagaacagcc taaggaaaag 180 
tcagcctacc ctatccaagt 240 
tccccatctt ttccaggctg 300 
ccctttttgg gtacctaaaa 360 

396 



<220> 

<221> inisc_feature 

<222> 13, 15, 21, 27, 34, 37, 41, 57, 58, 59, 63, 64, 71, 72, 77, 

78, 83, 87, 93, 170, 207, 210, 308, 379, 382, 389, 391, 

392, 393, 395 

<223> n = A,T,C or G 



<400> 4 

gaccaatcct tgncncacta ncaaaangac cccnctnacc nccaggaact gaacctnnnt 60 
gtnnacctcc nnctgcnnag ccntatntcc aanatcaccc accgtatcca ctgggaatct 120 
gccagcctcc tgcgatcaga agagaccaat cgaaaatgag ggtttcacan tcacagctga 180 
aggaaaaggc caaggcacct tgtcggnggn gacaatgtac catgctaagg ccaaagatca 24 0 
actcacctgt aataaattcg acctcaaggt caccataaaa ccagcaccgg aacagaaaaa 300 
gaggcctnag gatgcccaag aaacactttt gatcctttga aaactgtacc aaggtaccgg 360 
ggggagaccc aggaaaggnc cnttatgtnt nnntnt 396 

<210> 5 
<211> 396 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 135, 172, 343, 348, 354, 395 

<223> n = A,T,C or G 



<400> 5 

gacgccggag ctgccgcgcc agtcgcctag 

gatcttcatc ggcacagggg ccactgagac 

tttctcttcc ctctngttta gtttgcctgg 

cccaaaccct ttctgcttct gcccatcaca 

tcctccctct tctcccgact atttggcaag 

gatgctgctg gcaagacaac cattcttgat 

atttctacca ttgggtttaa tgggggaaac 



caggtcctct accggcttat tcctgtgccg 60 
gtttctgcct ccctctttct tcctccgctc 120 
gagcttgaaa ggagaaagca cnggggtcgc 180 
agtgccacta ccgccatggg cctcactatc 240 
aagcagatgc gcattttgat ggttggattg 300 
aaactgaaag tanggganat aagnaccacc 360 
agtana 396 
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<210> 6 

<211> 396 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^feature 
<222> 212 

<223> n « A,T,C or G 
<400> 6 

acgggaggcg ccgggaagtc gacggcgccg 
ctcccgtgaa gatgtccact ccagacccac 
ccccgggccc tgcccttccc ctggagccat 
ctccgcccac agcatgatgg ggcccagccc 
ccacccaggg gcctggaggg taccctcagg 
agtccatgca tgagaagggc atgtcggacg 
ggatgcggtc agggggccat gctgggatgg 

<210> 7 

<211> 396 

<212> DNA 

<213> Homo sapiens 



gcggctcctg caggaggcca ctgtctgcag 60 
ccctgggcgg aactcctcgg ccaggtcctt 120 
gctgggccct agcccgggtc cctcgccggg 180 
angggccgcc ctcagcagga caccccatcc 24 0 
acaacatgca ccagatgcac aagcccatgg 300 
acccgcgcta caaccagatg aaaggaatgg 360 
ggcccc 396 



<40D> 7 

acccgagagt cgtcggggtt tcctgcttca 
ccccaccccg gccggccgcc catagccagc 
ctgccccaag gcccccgccg ccgctccagc 
ctccttagtc gccgccatga cgaccgcgtc 
ggactcagag gccgccatca accgccagat 
cctgtccatg tcttactact ttgaccgcga 
ctttcttcac caatctcatg aggagaggga 

<210> 8 

<211> 396 

<212> DNA 

<213> Homo sapiens 



acagtgcttg gacggaaccc ggcgctcgtt 60 
cctccgtcac ctcttcaccg caccctcgga 120 
gccgcgcagc caccgccgcc gccgccgcct 180 
cacctcgcag gtgcgccaga actaccacca 240 
caacctggag ctctacgcct cctacgttta 300 
tgatgtggct ttgaagaact ttgccaaata 360 
acatgc 396 



<400> 8 

cgacaacaag gttaatacct tagttcttaa 
tgctaccttg gtaggaaact tatttacaaa 
aatataaagt gctctgaata aagcagaaat 
aaaccaccca aatgaccaag gcatatatag 
tagggaggag aatgaaggaa aatgcaacca 
agtagaaggc acaggagagg tagcaaaggc 
ggtgaaaaaa acactgccat tcacaagtca 

<210> 9 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 321, 344 
<223> n « A,T,C or G 



catttttttt ctttatgtgt agtgttttca 60 
ccatattaaa aggctaattt aaatataaat 120 
atattacagt tcattccaca gaaagcatcc 180 
tatttggagg aatcaggggt ttggaaggag 24 0 
gcatgattat agtgtgttca tttagataaa 300 
caggcttttc tttggttttc ttcaaacata 360 
aggaac 396 
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<400> 9 

tcgacatcgc ggcaactttt tgcggattgt tcttgcttcc aggctttgcg ctgcaaatcc 60 
agtgctacca gtgtgaagaa ttccagctga acaacgactg ctcctccccc gagttcattg 120 
tgaattgcac ggtgaacgtt caagacatgt gtcagaaaga agtgatggag caaagtgccg 180 
ggatcatgta ccgcaagtcc tgtgcatcat cagcggcctg tctcatcgcc tctgccgggt 240 
accagtcctt ctgctcccca gggaaactga actcagtttg catcagctgc tgcaacaccc 300 
ctctttgtaa cgggccaagg nccaaaaaaa ggggaaagtt ctgncctcgg ccctcaggcc 360 
agggctccgc accaccatcc tgttcctcaa attagc 396 

<210> 10 
<211> 396 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__feature 

<222> 115, 116, 117, 130, 138, 142, 143, 144, 145, 146, 153, 157, 
158, 159, 160, 164, 175, 176, 177, 178, 179, 183, 187, 197, 
198, 202, 203, 204, 205, 206, 211, 212, 213, 215, 216, 217, 
220, 221, 222, 226, 231, 234, 236, 237, 245, 246, 247 
<223> n = A,T,C or G 

<221> inisc_f eature 

<222> 250, 255, 264, 266, 267, 268, 269, 270, 271, 272, 279, 284, 
297, 303, 304, 305-, 308, 315, 317, 318, 319, 320, 321, 322, 
323, 333, 334, 337, 338, 342, 343, 368, 372, 374, 380, 381, 
391, 395 

<223> n = A,T,C or G 

<400> 10 

cctttttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 60 
tttttttttt tttttttttt tttttttttt tttttttttt ttttaaaaaa aaaannnttt 120 
tttttttttn aaaaaaangg gnnnnntttt ttncccnnnn gggngggggg ggggnnnnnt 180 
ttnaaanaaa aaaaccnnaa annnnngggg nnnannnaan nncccncccc naancnntaa 240 
aaaannnggn aaaanagggg gggnannnnn nnggggggna aaantttttt ttttttnaag 300 
ggnnnggnaa aaaantnnnn nnnttttttt ttnnaanngg gnnaaaaaaa aaaaaaaaaa 360 
attttttngg gntnaggggn ngggggaaaa ncccna 396 



<210> 11 
<211> 396 
<212> DMA 

<213> Homo sapiens 
<400> 11 

agaacacagg tgtcgtgaaa actaccccta 
atcaacattg tcgtcattgg acacgtagat 
atctataaat gcggtggcat cgacaaaaga 
gagatgggaa agggctcctt caagtatgcc 
gaacgtggta tcaccattga tatctccttg 
actatcattg atgccccagg acacagagac 
caggctgact gtgctgtcct gattgttgct 

<210> 12 
<211> 396 
<212> DNA 

<213> Homo sapiens 



aaagccaaaa tgggaaagga aaagactcat 60 
tcgggcaagt ccaccactac tggccatctg 120 
accattgaaa aatttgagaa ggaggctgct 180 
tgggtcttgg ataaactgaa agctgagcgt 240 
tggaaatttg agaccagcaa gtactatgtg 300 
tttatcaaaa acatgattac agggacatct 360 
gctggt 396 



<400> 12 
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cgaaaacctt taaaccccgg tcatccggac 

cttctgtggt gtcatttctg aaacaagggc 

gtcttcaagt gacctgtact gcttggggac 

gtttaaaaaa tatgtatcta agaatgttct 

atttcgggcc ctcctcttca ggaatcttcc 

tggcttttgc tgcggccccg tggggtagga 

acattcagag gcatcacaag taatggcaca 

<210> 13 

<211> 396 

<212> DNA 

<213> Homo sapiens 



atcccaacgc atgctcctgg agctcacagc 60 
gtggatccct caaccaagaa gaatgtttat 120 
tattggagaa aataaggtgg agtcctactt 180 
agggcactct gggaacctat aaaggcaggt 240 
tgaagacatg gcccagtcga aggcccagga 300 
gggacagaga gacagggaga gtcagcctcc 360 
attctt 396 



<400> 13 

accacaggct ggccacaaga agcgctggag 

ggtccggctg ctgcacgtgc gtgccggctt 

ggccatcgcc accctgtgct tcagccccgc 

tgacaagcgg atcatcctct gggacatcgg 

cagccagctg ctcacactgg acaccacctc 

ctgcccggac gcccgcctgc tggccggctg 

gctggaccag ccccaaaaga ggagggtgtg 

<210> 14 

<211> 396 

<212> DNA 

<213> Homo sapiens 



tgtgctggcg gctgcaggcc tacggggcct 60 

ctgctgcggg gtcatccgag cccacaagaa 120 

ccacgagacc catctcttca cggcctccta 180 

ggtgcccaac caggactacg aattccaggc 240 

tatccccctg cgcctctgcc ctgtcgcctc 300 

cgagggcggc tgctgctgct gggacgtgcg 360 
tgaagt 396 



<400> 14 

acggcgtcct cgtggaagtg acatcgtctt 
gccgtgatgc ccagggaaga cagggcgacc 
caactattgg atgattatcc gaaatgtttc 
cagatgcagc agatccgcat gtcccttcgc 
accatgatgc gcaaggccat ccgagggcac 
ctgcctcata tccgggggaa tgtgggcttt 
agggacatgt tgctggccaa taaggtgcca 

<210> 15 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 333 

<223> n = A,T,C or G 



taaaccctgc gtggcaatcc ctgacgcacc 60 

tggaagtcca actacttcct taagatcatc 120 

attgtgggag cagacaatgt gggctccaag 180 

gggaaggctg tggtgctgat gggcaagaac 240 

ctggaaaaca acccagctct ggagaaactg 300 

gtgttcacca aggaggacct cactgagatc 360 
gctgct 396 



<400> 15 

accgcgcggg cacagggtgc cgctgaccga ggcgtgcaaa gactccagaa ttggaggcat 60 

gatgaagact ctgctgctgt ttgtggggct gctgctgacc tgggagagtg ggcaggtcct 120 

gggggaccag acggtctcag acaatgagct ccaggaaatg tccaatcagg gaagtaagta 180 

cgtcaataag gaaattcaaa atgcttgtca acggggtgaa acagataaag actctcatag 240 

aaaaaacaaa cgaagagcgc aagacactgc tcagcaacct agaagaagcc aagaagaaga 300 

aagaggatgc cctaaatgag accagggaat canagacaaa gctgaaggag ctcccaggag 360 

tgtgcaatga gaccatgatg gccctctggg aagagt 396 

<210> 16 
<211> 396 
<212> DNA 
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<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 114, 121, 122, 123, 127, 134, 136, 138, 140, 141, 142, 143, 
144, 148, 163, 166, 172, 173, 174, 176, 177, 183, 184, 185, 
187, 195, 196, 198, 199, 202, 203, 206, 213, 214, 215, 216, 
217, 218, 219, 223, 225, 226, 227, 229, 230, 236, 238 
<223> n ^ A,T,C or G 



<221> misc_f eature 

<222> 239, 252, 256, 257, 261, 262, 268, 269, 273, 278, 280, 288, 
289, 290, 292, 293, 303, 312, 325, 327, 333, 335, 336, 341, 
342, 347, 354, 359, 365, 371, 383, 384, 386, 388, 391 
<223> n = A,T,C or G 



<400> 16 

tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 60 

tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt tttngggggg 120 

nnnaaanttt tttntnanan nnnngggnaa aaaaaaaaaa aanaangggg gnnntnnggc 180 

ccnnnanaaa aaaanngnna annaancccc ccnnnnnnnc ccncnnntnn ggaaananna 240 

aaaccccccc cngggnnggg nnaaaaannc ccnggggnan tttttatnnn annccccccc 300 

ccnggggggg gnggaaaaaa aaaantnccc ccnannaaaa nnggggnccc cccnttttnc 360 

aaaanggggg nccgggcccc ccnnantntt nggggg 396 

<210> 17 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<400> 17 

accacactaa ccatatacca atgatggcgc gatgtaacac gagaaagcac ataccaaggc 60 
caccacacac cacctgtcca aaaaggcctt cgatacggga taatcctatt tattacctca 120 
gaagtttttt tcttcgcagg atttttctga gccttttacc actccagcct agcccctacc 180 
ccccaactag gagggcactg gcccccaaca ggcatcaccc cgctaaatcc cctagaagtc 240 
ccactcctaa acacatccgt attactcgca tcaggagtat caatcacctg agctcaccat 300 
agtctaatag aaaacaaccg aaaccaaata attcaagcac tgcttattac aattttactg 360 
ggtctctatt ttaccctcct acaagcctca gagtac 396 



<210> 18 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 51, 54, 66, 81, 86, 98, 106, 111, 117, 124, 129, 133, 135, 
150, 151, 154, 159, 161, 172, 179, 181, 183, 185, 220, 223, 
229, 238, 258, 259, -264, 282, 289, 292, 294, 299, 303, 311, 
315, 329, 343, 349, 351, 353, 361, 369, 370, 389, 392 
<223> n = A, T,C or G 



<221> misc_feature 
<222> 396 

<223> n = A, T,C or G 

<400> 18 

tttttttttt tttttttttt tttttttttt tttttttttt ttttttttta ntcnaaaggg 60 
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gaaggnccct ttttattaaa nttggncatt ttactttnct tttttnaaaa ngctaanaaa 120 

aaanttttnt ttntncttaa aaaaaccctn natntcacna ncaaaaaaaa cnattcccnc 180 

ntncnttttg tgataaaaaa aaaggcaatg gaattcaacn tancctaana aaactttncc 24 0 

tgggaggaaa aaaaattnnt ccgngggaaa cacttggggc tntccaaant gnanccatnc 300 

tangaggacc ntctntaaga tttccaaang aaaccccttc ctnccaaang nantaccccg 360 

ntgcctacnn cccataaaaa aaacctcanc cntaan 396 

<210> 19 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 47, 69, 75, 80, 83, 87, 88, 90, 92, 102, 104, 108, 116, 121, 
130, 138, 139, 142, 153, 156, 158, 162, 165, 166, 180, 192, 
193, 195, 201, 224, 226, 232, 235, 237, 241, 248, 251, 253, 
256, 269, 272, 274, 277, 284, 287, 290, 292, 297 
<223> n = A,T,C or G 

<221> misc^feature 

<222> 299, 305, 306, 315, 323, 324, 326, 332, 351, 368, 377, 380, 

383, 387, 392 

<223> n = A,T,C or G 

<400> 19 

tttttttttt tttttttttt tttttttttt tttttttttt ttttttntgg tctgggcttt 60 

tattttacna aaaanctaan ggnaaanntn cnttaaacta antngaanac aaagtnttaa 120 

ngaaaaaggn ctgggggnnt cntttacaaa aanggncngg gncanntttg ggcttaaaan 180 

ttcaaaaagg gnncntcaaa ngggtttgca tttgcatgtt tcancnctaa ancgnangaa 240 

naaacccngg ngnccnctgg gaaaagttnt tnanctncca aaanatnaan tntttgnanc 300 

agggnntttt tgggnaaaaa aannanttcc anaaactttc catcccctgg ntttgggttc 360 
ggccttgngt tttcggnatn atntccntta angggg 396 

<210> 20 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 29, 43, 49, 53, 55, 75, 81, 100, 110, 111, 125, 129, 160, 
162, 168, 246, 277 
<223> n = A,T,C or G 

<400> 20 

tttttttttt tttttttttt ttttttctna acaaaccctg ttnttgggng ggngngggta 60 
taatactaag ttganatgat ntcatttacg ggggaaggcn ctttgtgaan naggccttat 120 
ttctnttgnc ctttcgtaca gggaggaatt tgaagtaaan anaaaccnac ctggattact 180 
ccggtctgaa ctcaaatcac gtaggacttt aatcgttgaa caaacaaacc tttaatagcg 240 
gctgcnccat tgggatgtcc tgatccaaca tcgaggncgt aaaccctatt gttgatatgg 300 
actctaaaaa taggattgcg ctgttatccc tagggtaact tgttcccgtg gtcaaagtta 360 
ttggatcaat tgagtataag tagttcgctt tgactg 396 

<210> 21 

<211> 396 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<222> 6, 9, 18, 23, 31, 43, 48, 55, 65, 73, 75, 103, 110, 117, 
123, 125, 134, 153, 182, 195, 202, 205, 213, 216, 223, 239, 
249, 276, 293, 294, 302, 307, 344, 356, 359, 369, 374, 381, 
392 

<223> n = A,T,C or G 
<400> 21 

acatanatnt tatactanca ttnaccatct cacttgnagg aanactanta tatcnctcac 60 
acctnatatc ctncntacta tgcctagaag gaataatact atngctgttn attatancta 120 
ctntnataac cctnaacacc cactccctct tanccaatat tgtgcctatt gccatactag 180 
tntttgccgc ctgcnaagca gnggngggcc tanccntact agnctcaatc tccaacacnt 240 
atggcctana ctacgtacat aacctaaacc tactcnaatg ctaaaactaa tcnncccaac 300 
anttatntta ctaccactga catgactttc caaaaaacac atantttgaa tcaacncanc 360 
cacccacanc ctanttatta ncatcatccc cntact 396 



<210> 22 

<211> 396 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 17, 244 

<223> n =» A,T,C or G 



<400> 22 

tttttttttt ttttganaaa agccggcata 
gactcataaa tggtgctggg ggaagggtgc 
ggacagcaaa ggggtgagaa ggggctgagg 
agggagccaa gcatcaaaaa acaggagatg 
taanagaaca ttcaaggatt tgtcatgatg 
aacagcacct tcaattgaaa ctgtcaatta 
ttggaaagtt atgagattac aaaattcctg 

<210> 23 

<211> 396 

<212> DNA 

<213> Homo sapiens 



aagcactttt attgcaataa taaaacttga 60 

agcaacgatt tctcaccaaa tcactacaca 120 

gaggaaaagc caggaaactg agatcagcag 180 

ctgaagctgc gatgaccagc atcattttct 240 

gctgggcttt cactgggtgt taagtctaca 300 

aagttcttaa gatttaggaa gtggtggagc 360 

aaagtc 396 



<400> 23 

acaaaggcgg ttccaagcta aggaattcca 
gcaggcctgt gaggttttca tatcctgaag 
gaaaaaatgt cagacacaca caaaagtaga 
ctgcagcttc aacagttacc acatttgcca 
cctggaaagc ccacggcccc tttttccctt 
gcagccaacc aaaggggcag ctcagctcct 
tgaagagttg atgtctttga caacatacgg 

<210> 24 

<211> 396 

<212> DNA 

<213> Homo sapiens 



tcagtgcttt tttcgcagcc accaaattta 60 

agatgtattt taaagctttt tttttttaat 120 

atagtaccat ggagtcccca cgtacccagc 180 

accggagaga ctgccaaggc aggaaaaagc 240 

gggtcagagg ccttagagct ggctgccaaa 300 

tcgtggcacc agcagtgttc ctgatgcagt 360 

acactg 396 



<220> 

<221> misc feature 
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<222> 313, 337, 340, 350, 351, 352, 353, 354, 355, 356, 366, 376, 
377, 378, 382, 384, 385, 387, 389, 390, 392, 393, 396 
<223> n = A,T,C or G 

<400> 24 

cgactatcct ctcagattct tatctggcac taatttataa ctattatatt atcagagact 60 

atgtagcaat atatcagtgc acaggcgcat cccaggcctg tacagatgta tgtctacacg 120 

taagtataaa tgaatttgca taccaggttt tacacttgca tctctaatag agattaaaaa 180 

caacaaattg gcctcttcct aagtatatta atatcattta tccttacatt ttatgcctcc 240 

ccctaaatta atgactgagt tggtggaaag cggctaggtt ttattcatac tgttttttgt 300 

tctcaacttc aanagtaatc tacctctgaa aaatttntan tttaatattn nnnnnnagga 360 

atttgngcca ctttannnct tncnntntnn tnnccn 396 

<210> 25 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 90, 125, 136, 278, 299, 301, 305, 344, 347, 353, 355, 356, 
357, 359, 360, 361, 365, 369, 378, 380, 381, 382, 383, 384, 
385, 386, 391, 392, 393, 395, 396 
<223> n « A,T,C or G 

<400> 25 

tttttttttt tttttttttt gtcttttaaa aaatataaaa gtgttattat tttaaaacat 60 
caagcattac agactgtaaa atcaattaan aactttctgt atatgaggac aaaaatacat 120 
ttaanacata tacaanaaga tgctttttcc tgagtagaat gcaaactttt atattaagct 180 
tctttgaatt ttcaaaatgt aaaataccaa ggctttttca catcagacaa aaatcaggaa 24 0 
tgttcacctt cacatccaaa aagaaaaaaa aaaaaaancc aattttcaag ttgaagttna 300 
ncaanaatga tgtaaaatct gaaaaaagtg gccaaaattt taanttncaa canannngnn 360 
ncagntttna tggatctntn nnnnnncttc nnntnn 396 

<210> 26 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 313, 314, 316, 318, 321, 343, 344, 352, 353, 356, 363, 366, 
370, 372, 373, 374, 375, 377, 378, 379, 383, 384, 385, 386, 
387, 391, 393, 394, 395, 396 
<223> n = A,T,C or G 

<400> 26 

gacgctcccc cctccccccg agcgccgctc cggctgcacc gcgctcgctc cgagtttcag 60 

gctcgtgcta agctagcgcc gtcgtcgtct cccttcagtc gccatcatga ttatctaccg 120 

ggacctcatc agccacgatg agatgttctc cgacatctac aagatccggg agatcgcgga 180 

cgggttgtgc ctggaggtgg aggggaagat ggtcagtagg acagaaggta acattgatga 24 0 

ctcgctcatt ggtggaaatg cctccgctga aggccccgag ggcgaaggta cccgaaagca 300 

cagtaatcac tgnngncnat nttgtcatga accatcacct gcnngaaaca annttnacaa 360 

aanaancctn cnnnnannnc ctnnnnnatt ncnnnn 396 

<210> 27 
<211> 396 
<212> DNA 



wo 02/39885 



PCTAJSOl/45395 



10 



<213> HoiQO sapiens 



<220> 

<221> raisc__feature 

<222> 49, 61, 66, 73, 75, 99, 102, 103, 105, 107, 120, 124, 126, 
129, 138, 139, 141, 147, 155, 157, 162, 165, 175, 187, 191, 
193, 198, 207, 217, 218, 220, 221, 223, 226, 231, 232, 245, 
257, 259, 260, 263, 266, 271, 287, 305, 306, 307, 308 
<223> n = A, T,C or G 

<221> misc_feature 

<222> 321, 330, 332, 335, 342, 343, 344, 345, 349, 350, 351, 352, 
354, 355, 356, 357, 365, 366, 367, 370, 371, 372, 373, 374,. 
375, 376, 377, 378, 379, 380, 381, 382, 383, 386, 387, 388, 
389, 391, 392, 393, 394, 395, 396 
<223> n = A, T,C or G 

<400> 27 

tttttttttt tttttttttt tttttttttt tttttttttt tggctaaant ttatgtatac 60 
nggttnttca aangnggggg aggggggggg gcatccatnt anncncncca ggtttatggn 120 
gggntnttnt actattanna nttttcnctt caaancnaag gnttntcaaa tcatnaaaat 180 
tattaanatt ncngctgnta aaaaaangaa tgaaccnncn nanganagga nntttcatgg 240 
ggggnatgca tcggggnann ccnaanaacc ncggggccat tcccganagg cccaaaaaat 300 
gtttnnnnaa aaagggtaaa nttacccccn tnaantttat annnnaaann nnannnnagc 360 
ccaannnttn nnnnnnnnnn nnnccnnnna nnnnnn 396 



<210> 28 
<211> 396 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> inisc_feature 

<222> 278, 283, 298, 309, 326, 331, 338, 351, 355, 356, 357, 358, 
360, 371, 377, 378, 383, 386, 387, 391, 393, 394, 395 
<223> n = A,T,C or G 

<400> 28 

cgaccttttt tttttttttt atagatgaaa gagggtttat ttattaatat atgatagcct 60 

tggctcaaaa aagacaaatg agggctcaaa aaggaattac agtaacttta aaaaatatat 120 

taaacatatc caagatccta aatatattat tctccccaaa agctagctgc ttccaaactt 180 

gatttgatat tttgcatgtt ttccctacgt tgcttggtaa atatatttgc ttctcctttc 240 

tgcaatcgac gtctgacagc tgatttttgc tgttttgnca acntgacgtt tcaccttntg 300 

tttcaccant tctggaggaa ttgttnaaca ncttacanca ctgccttgaa naaannnnan 360 

gcctcaaaag ntcttgnnct atnctnnttc ntnnnt 396 

<210> 29 

<211> 396 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 329, 334, 361, 386, 390 

<223> n = A,T,C or G 

<400> 29 

gacttgctca tttagagttt gcaggaggct ccatactagg ttcagtctga aagaaatctc 60 
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ctaatggtgc tatagagagg gaggtaacag aaagactctt ttagggcatt tttctgactc 120 
atgaaaagag cacagaaaag gatgtttggc aatttgtctt ttaagtctta accttgctaa 180 
tgtgaatact gggaaagtga tttttttctc actcgttttt gttgctccat tgtaaagggc 240 
ggaggtcagt cttagtggcc ttgagagttg cttttggcat ttaaatattc taagagaatt 300 
aactgtattt cctgtcacct attcactant gcangaaata tacttgctcc aaataagtca 360 
ntatgagaag tcactgtcaa tgaaanttgn tttgtt 396 

<210> 30 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 28, 83, 126, 138, 254, 275, 298, 310, 311, 353, 363, 374, 
379, 393 

<223> n = A,T,C or G 

<400> 30 

tttttttttt tttttttttg aaatttanaa acaaatttta tttaagatct gaaatacaat 60 
tcctaaaata tcaacttttc canaaaaccg tggctacaca ataatgcatt gcctctatca 120 
tgttanaacg tgcattanac tcaaatacaa aaaccatgaa acaaatcacc atccttcaac 180 
aatttgagca aagatagaat gcctaagaac aacatagatg gacttgcaga ggatgggctg 240 
ttttacttca agcnccataa aaaaaaaaaa gagcncaaat gcattgggtt ttcaggtnta 300 
tacattaagn ngaacctttg gcactaggaa tcagggcgtt ttgtcacata gcnttaacac 360 
atnttaaaaa attntgtant gtcaaaggga tangaa 396 

<210> 31 

<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 285, 287, 350, 362, 365, 377, 378, 382, 388, 390, 393 
<223> n = A,T,C or G 



<400> 31 

gacgggccag ggccatctgg aaagggaact 

tcgggtgtgt ggtgaacacg ttcagttcat 

gctgtggctc tgccatgccg ggctgtgttt 

aaaaccagcc acttcttttc ataagcactg 

tcagtgcctc acgcaggcaa atgcactgaa 

gtgagttccc ccgacagccc acgacagcca 

angancacgg cacacanntc ancctctnan 



cggcttttcc agaacgtggt ggatcatctg 60 
cagggcctac gctccgggaa ggggccccca 120 
gcagctgtcc gagtctccat ccgcctttag 180 
acagggccca gcccacagcc acaggtgcga 240 
acccaggggc acacncncgc agagtgaaca 300 
ggactgccct ccccaccccn ccccgacccc 360 
ctngct 396 



<210> 32 
<211> 396 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 341 

<223> n = A,T,C or G 
<400> 32 

cgactggcct cataccttgt ctacacagtc cctgcacagg gttcctaacc tgtggttagt 60 
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aaagaatgtc actttctaac aggtctggaa gctccgagtt tatcttggga actcaagagg 120 
agaggatcac ccagttcaca ggtatttgag gatacaaacc cattgctggg ctcggcttta 180 
aaagtcttat ctgaaattcc ttgtgaaaca gagtttcatc aaagccaatc caaaaggcct 240 
atgtaaaaat aaccattctt gctgcacttt atgcaaataa tcaggccaaa tataagacta 300 
cagtttattt acaatttgtt tttaccaaaa atgaggacta nagagaaaaa tggtgctcca 360 
aagcttatca tacatttgtc attaagtcct agtctc 396 

<210> 33 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 121, 122, 124, 125, 126, 128, 130, 131, 132, 133, 134, 136, 
137, 153, 154, 155, 156, 157, 158, 159, 168, 169, 170, 171, 
172, 173, 174, 175, 176, 177, 178, 179, 184, 185, 192, 197, 
199, 200, 202, 204, 205, 208, 209, 210, 211, 214, 215 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 216, 217, 218, 222, 227, 228, 229, 233, 234, 241, 242, 244, 
245, 246, 247, 248, 249, 252, 260, 261, 262, 263, 264, 265, 
270, 272, 273, 274, 275, 279, 282, 284, 288, 290, 291, 292, 
293, 294, 299, 300, 301, 302, 303, 306, 313, 314, 319 
<223> n A,T,C or G 

<221> misc_feature 

<222> 327, 328, 330, 331, 332, 333, 334, 335, 343, 349, 350, 351, 
352, 355, 360, 369, 370, 371, 375, 379, 387, 388, 390, 391, 
392, 393, 394, 395, 396 
<223> n = A,T,C or G 

<400> 33 

cctttttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 60 

tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 120 

nngnnntntn nnnnannaaa aaaaaaaaaa aannnnnnna aaaaaaannn nnnnnnnnnt 180 

tttnnggggg gnttttnann gnannttnnn nttnnnnnaa anccccnnng ggnngggggg 240 

nntnnnnnng gnaaaaaaan nnnnnggggn cnnnngggnc cncncccnan nnnnaaaann 300 

nnnggntttt ttnnttttna aaaaaanngn nnnnnaacaa aantttttnn nnaanttttn 360 

gggggaaann ncccntttnt ttttttnnan nnnnnn 396 

<210> 34 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<222> 8, 60, 72, 123, 128, 155, 172, 198, 207, 246, 305, 325, 348, 
349, 369, 371, 380, 393, 394 
<223> n = A,T,C or G 

<400> 34 

acggaccnag ctggaggagc tgggtgtggg gtgcgttggg ctggtgggga ggcctagttn 60 
gggtgcaagt angtctgatt gagcttgtgt tgtgctgaag ggacagccct gggtctaggg 120 
ganagagncc ctgagtgtga gacccacctt ccccngtccc agcccctccc anttccccca 180 
gggacggcca cttcctgntc cccgacncaa ccatggctga agaacaaccg caggtcgaat 240 
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tgttcntgaa ggctggcagt gatggggcca agattgggaa ctgcccattc tcccacagac 300 
tgttnatggt actgtggctc aaggnagtca ccttcaatgt taccaccnnt gacaccaaaa 360 
ggcggaccna nacagtgcan aagctgtgcc canngg 396 

<210> 35 
<211> 396 
<212> DNA 

<213> Homo sapiens 



<400> 35 

tcgaccaaaa 

tccccctcta 

aaagactgta 

gagtcctcca 

gcaccccccg 

cagtgctgac 

gaaggaaatt 



tcaaatctgg 
gaccctgtct 
caaccagttc 
acctcccttt 
ccaccctctg 
ctggcaccca 
attgataccg 



cactcacaag 
tgcaaaatcc 
ctccatttta 
caaccagtcc 
agccagtagt 
agaaaaagcc 
ccgatgagtt 



ccctggccga 
tctccctagc 
tagaagttta 
catcattcca 
gccagcagtg 
caggaagtca 
tgatga 



cccccaatgg 
cagctagtat 
ctcactccag 
accagtggta 
atgatggcca 
agcatgcctg 



gttttaccac 60 
tttctgggct 120 
gggaaatggt 180 
ccatagagca 240 
cccatgagcc 300 
tgaagattga 360 
396 



<210> 36 
<211> 396 
<212> DNA 

<213> Homo sapiens 



<400> 36 

tcgacgggaa gagcctgcta cggtggactg 
gaccccacgc tggaccccct gccggaccct 
cttcctttgg actggactgt ccctgctcat 
gctccaggtt gccacctcct ctcgccagag 
cccatctgcc cacaattcgg gagaccacgg 
agcccccacc ctctcctagc ctggatgact 
ccacctctgt gctggacaag gccacggccc 

<210> 37 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 376 

<223> n = A,T,C or G 



tgagactcag tgcactgtcc tcctcccagc 60 
ccacccttcg gcccccaagc ttcccagggg 120 
ccattctcct gccaccccca gacctcctca 180 
tgatgaggtc ccggcttctg ctctccgtgg 240 
aggagatgct gcttgggggt cctggacagg 300 
acgtgaggtc tatatctcga ctggcacagc 360 
agggcc 396 



<400> 37 

cgacggtgtc agcaactggc catgccacag 
ttgtttgagg attcctttca acagataatg 
ttgtgggtga tttttccagg agaagaaggt 
ttctttcttt tgtcacatga agtgttgaac 
aaggataact actgcttgca gataaacccc 
tattttcgtt ttattggcag atttattgcc 
cacgggtttt tctttnccat tctataagcg 

<210> 38 
<211> 396 
<212> DNA 

<213> Homo sapiens 



cacataaaga ttacagtgac aagaaaaaca 60 
agcttcagtc cccaagatct gcgaagacgt 120 
ttagattatg gaggtgtagc aagagaatgg 180 
ccaatgtatt gcctgtttga .atatgcaggg 240 
gcttcttaca tcaatccaga tcacctgaaa 300 
atggctctgt tccatgggaa aattcataga 360 
tatctt 396 



<400> 38 

cgaccaaaat gataaatagc tttaagaatg tgctaatgat aaatgattac atgtcaattt 60 
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aatgtactta atgtttaata ccttatttga ataattacct gaagaatata ttttttagta 120 

ctgcatttca ttgattctaa gttgcacttt ttacccccat actgttaaca tatctgaaat 180 

cagaatgtgt cttacaatca gtgatcgttt aacattgtga caaagtttaa tggacagttt 24 0 

tttcccatat gtatatataa aataatgtgt tttacaatca gtggcttaga ttcagtgaaa 300 

tacagtaatt cattcaatta tgatagtatc tttacagaca ttttaaaaat aagttatttt 360 

tatatgctaa tattctatgt tcaagtggaa tttgga 396 

' <210> 39 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<400> 39 

tcgaccaaga atagatgctg 
ccaaccctca gagccacccc 
ggctgccctg gtgctgccac 
agaacccttg gagccaatgg 
acagggcaag tgggacaggg 
ctggctgcct tagaaccttt 
ctttgggtct tgtttttttt 

<210> 40 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 200, 375 
<223> n = A,T,C or G 

<400> 40 

tttttttttt ttttgttatt tagtttttat ttcataatca taaacttaac tctgcaatcc 60 

agctaggcat gggagggaac aaggaaaaca tggaacccaa agggaactgc agcgagagca 120 

caaagattct aggatactgc gagcaaatgg ggtggagggg tgctctcctg agctacagaa 180 

ggaatgatct ggtggttaan ataaaacaca agtcaaactt attcgagttg tccacagtca 240 

gcaatggtga tcttcttgct ggtcttgcca ttcctggacc caaagcgctc catggcctcc 300 

acaatattca tgccttcttt cactttgcca aacaccacat gcttgccatc caaccactca 360 

gtcttggcag tgcanatgaa aaactgggaa ccattt 396 

■ <210> 41 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> 288 

<223> n = A,T,C or G 
<400> 41 

tcgacctctt gtgtagtcac ttctgattct gacaatcaat caatcaatgg cctagagcac 60 
tgactgttaa cacaaacgtc actagcaaag tagcaacagc tttaagtcta aatacaaagc 120 
tgttctgtgt gagaattttt taaaaggcta cttgtataat aacccttgtc atttttaatg 180 
tacaaaacgc tattaagtgg cttagaattt gaacatttgt ggtctttatt tactttgctt 240 
cgtgtgtggg caaagcaaca tcttccctaa atatatatta cccaaagnaa aagcaagaag 300 
ccagattagg tttttgacaa aacaaacagg ccaaaagggg gctgacctgg agcagagcat 360 
ggtgagaggc aaggcatgag agggcaagtt tgttgt 396 



actgtactcc tcccaggcgc 
taaagagata ctttgatatt 
acttcaggct cttctccttt 
agactgtctc aagagggcac 
catggccagg tggccactcc 
cttacattag cagtttgctt 
ttccacttag aaattg 



cccttccccc tccaatccca 60 
ttcaacgcag ccctgctttg 120 
cacaaccttc tgtggctcac 180 
tggtggcccg acagcctggc 240 
agacccctgg cttttcactg 300 
tgtatgcact ttgttttttt 360 

396 
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<210> 42 

<211> 396 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^feature 

<222> 65, 68, 69, 71, 72, 75, 77, 79, 82, 85, 86, 87, 89, 90, 97, 
98, 105, 107, 109, 112, 117, 121, 122, 124, 126, 149, 152, 
153, 155, 157, 161, 163, 167, 168, 169, 174, 177, 178, 179, 
180, 186, 188, 192, 201, 202, 207, 208, 215, 217, 220 
<223> n A,T,C or G 

<221> misc_feature 

<222> 225, 230, 242, 243, 247, 250, 259, 263, 271, 272, 279, 284, 
295, 298, 299, 308, 309, 312, 323, 342, 348, 351, 363, 366, 
370, 386, 390, 392 
<223> n = A, T,C or G 



<400> 42 

cttttttttt tttttttttt tttttttttt 
aaaanccnna nnaananang gnaannnann 
nntntnaggg ggggggttca aaaccaaang 
99999nanaa anaaaaaggg nngaaanntg 
cnngggntan aaaaaaagnt ganccctaaa 
caaatccnnt gngggttggg ggnggggaaa 
ggnttnaaan atttgggttc gggggntttn 



tttttttttt tttttttttt tttttttttt 60 
aaaaaannca aaccncntnt anaaaangcc 120 
gnngntngga ngnaaannna aaanttnnnn 180 
acccnanaan gaccngaaan cccgggaaac 240 
nncccccgna aaanggggga agggnaannc 300 
aaaaaaaccc cnaaaaantg naaaaaaccg 360 
tnttaa 396 



<210> 43 

<211> 396 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^f eature 

<222> 108, 195, 213, 279, 287, 349 

<223> n = A,T,C or G 



<400> 43 

tttttttttt 

tcattgaggc 

tgcactttaa 

ctgtgctttt 

cagggtacag 

gtaagaaact 

gccaaggttt 



ttttgcttca 
ttctgttaaa 
gatgaactaa 
cttanactaa 
taacatagta 
agcccattat 
ccaaacacaa 



ctgctttatt 
agttcttcca 
cttttgggat 
aagcatactg 
aagtacctgc 
ttaagtgtcc 
tcttgttctc 



tttgaaatca 
aagttgccca 
tctcttcaaa 
canaaaactc 
ctattttana 
acagtatttt 
taatac 



caagcaattc 
gttttaanat 

gaaggaaagt 
tattttaaaa 
atcctanaga 
tcatttcant 



aaagtgatca 60 
taaacaatat 120 
attgctccat 180 
atcaacactg 240 
acatttcatt 300 
ggtccaagat 360 
396 



<210> 44 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<400> 44 

gacctagttt tacctcttaa atatctctgt tcccttctaa gttgtttgct gtgttttctt 60 
cagagcaaga aggttatatt ttttaaaatt tacttagtaa tgcacattca aaacacacat 120 
caagtcttca ggataaagtt caaaaccgct gtcatggccc catgtgatct ctccctcccc 180 
tacccctcta tcatttagtt tcttctgcgc aagccactct ggcttccttt cagttttgtg 240 
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gttcccgttt ttagctagtt cagtggtttt caatgggcat ttcttgcctt tttttttcta 300 
aacgacaaat agaaatacat cttctttatt atcctccaaa tccaattcag aggtaatatg 360 
ctccacctac acacaatttt agaaataaat taaaaa 396 

<210> 45 
<211> 396 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<222> 18, 19, 22, 39, 40, 43, 62, 84, 90, 99, 103, 104, 105, 117, 
120, 123, 128, 134, 139, 141, 142, 143, 144, 145, 182, 187, 
207, 218, 219, 242, 247, 257, 260, 263, 272, 276, 277, 279, 
284, 288, 294, 296, 297, 305, 310, 314, 319, 320, 322 
<223> n = A, T,C or G 



<221> misc_feature 

<222> 364, 366, 376, 378, 381, 387, 388, 396 
<223> n = A, T,C or G 



<400> 45 

tttttttttt ttttaaannt tntaaatttt taatgaaann ganttagaac aatgtattat 60 
tnacatgtaa ataaaaaaag agancataan ccccatatnc tcnnnaaagg aaggganacn 120 
gcnggccntt tatnagaana nnnnncatat aagaccccat taagaagaat ctggatctaa 180 
anacttncaa acaggagttc acagtangtg aacagcannc cctaatccca ctgatgtgat 240 
gnttcanata aaatcancan cgntgatcgg gnatcnnanc aatntgancg gaanannact 300 
gctcnatatn tttnaggann cngatgtggt cattttttac aaagataatg gccacaccct 360 
tccngnccga atcgancnga nctcccnntt ctgtgn 396 

<210> 46 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> 24, 105, 144, 188, 190, 214, 317, 369, 371, 378 
<223> n = A,T,C or G 



<400> 46 

tttttttttt tttttttttc 
tggtgccatc tcggctcact 
agcctcccgg gtagctggga 
tttagtanan atggcgtttc 
atccacccac ctcggcctcc 
cctgaaatat ctatttnttt 
gtgagctana naaagtgngt 

<210> 47 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__feature 
<222> 290 

<223> n = A,T,C or G 



tganacagag tctcattctg 
gcaacctccg- cctcctgggt 
ctanaggcac acgccaccac 
accatgttga ccanactgat 
caaagtgctg ggattacagg 
tcagattatt tttaaaattc 
gtgtacatgc acacac 



ttgcctaggc tggattgcag 60 
tccanaaatt ctcctgcctc 120 
gccaggctaa tttttatatt 180 
ctcgaactcc cgacctcgtg 240 
cgtgaaacca ccaggcccgg 300 
catttgatga atcttttaaa 360 

396 
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<400> 47 

tttttttttt tttttttgct gttgccaact gtttattcag ggccctgaac gggtggtgcg 60 
tggacatgca acacactcgg gcccacagca gcgtgaccgg ccgctcccaa gccccgggcg 120 
cacaaccaca gccaggagca gcccctgcca ccactgggcc accgtccagg gccccacagg 180 
accagccgaa ggtgccccgg gccgaggcca gctgggtcag gtgtacccct agcctggggt 240 
tgagtgagga gcggcacccc cagtatcctg tgtaccccaa gttgcccagn aggccgaggg 300 
.ggccttgggc tccatctgca ctggccaccc cgtgccaagc atcacagctg cgtgagcagg 360 
tttgtgtgtg agcgtgtggc ggggcctggt tgtccc 396 

<210> 48 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> 393, 396 
<223> n = A,T,C or G 

<400> 48 

ctgggcctgt gccgaagggt ctgggcagat cttccaaaga tgtacaaaat gtagaaattg 60 

ccctcaagca aatgcaaaga tgctcaacac ccttagtcat caagaaaatg caaatggaat 120 

ccacagagag atactgcaca ctgacaaaga tggtcgtatt actaaaggtg aataaccagc 180 

gcggggggca cgtggagtca ctggaacatt tgtgcaatgc tggtgggaat gtcaacccgt 240 

gcggccctct ggaataagcc tggcagctcc tccaagagtt acccgtgtga cccagcaatt 300 

ccactcctag ctccacccac aggaattgaa agcaaagacg caaacagatg cctgtgcacc 360 

aaagttcacg gcagcatcct tcgccatagt ggnaan 396 

<210> 49 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 32, 40, 44, 64, 70, 83, 87, 92, 104, 115, 118, 125, 127, 
130, 137, 155, 168, 171, 173, 175, 192, 201, 206, 208, 218, 
219, 235, 247, 249, 256, 259, 260, 269, 297, 306, 310, 320, 
321, 328, 331, 345, 356, 381, 389, 395 
<223> n = A,T,C or G 

<400> 49 

accccaaaat gggaaaggaa aagactcata tnaacattgn cgtnattgga cacgtacatt 60 
cggncaagtn caccactact ggncatntga tntataaatg cggnggcatc gacanaanaa 120 
ccatngnaan atttganaag gaggctgctg atatnggaaa gggctccntc nantntgcct 180 
gggtcttgga tnaactgaaa nctgancntg aacgtggnnt caccattgat atctncttgt 240 
ggaaatntna gaccancann tactatgtna ctatcattga tgccccagga cacaganact 300 
ttatcnaaan catgattacn nggacatnta nagctgactg tgctngcctg attgtngctg 360 
ctggtgttgg tgaatttgaa nctggtatnt ccaana 396 

<210>- 50 

<211> 396 

<212> DNA 

<213> Homo sapiens 



<400> 50 

cgacttcttg ctggtgggtg gggcagtttg gtttagtgtt atactttggt ctaagtattt 60 
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gagttaaact gcttttttgc taatgagtgg 

tgttgattgt tactagtggc attaactttt 

taatatccca gctagagata tggcctttaa 

tttttcccgt tcctttttct tcagtaaacc 

gatgtcctta taggtcacta cccctaaata 

actaaaaaat acctaaaagg aagcttagat 

<210> 51 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 18, 52, 59, 148, 267, 321, 

<223> n = A, T,C or G 



gctggttgtt agcaggtttg tttttcctgc 120 
agaatttggg ctggtgagat taattttttt 180 
ctgacctaaa gaggtgtgtt gtgatttaat 24 0 
caacaatagt ctaaccttaa aaattgagtt 300 
aacctgaagc aggtgttttc tcttggacat 360 
gggctg 396 



<400> 51 

tttttttttt ttcagcgngg atttatttta tttcattttt tactctcaag anaaagaana 60 

gttactattg caggaacaga cattttttta aaaagcgaaa ctcctgacac ccttaaaaca 120 

gaaaacattg ttattcacat aataatgngg ggctctgtct ctgccgacag gggctgggtt 180 

cgggcattag ctgtgccgtc gacaatagcc ccattcaccc cattcataaa tgctgctgct 240 

acaggaaggg aacagcggct ctcccanaga gggatccacc ctggaacacg agtcacctcc 300 

aaagagctgc gactgtttga naatctgcca anaggaaaac cactcaatgg gacctggata 360 

acccaggccc gggagtcata gcaggatgtg gtactt 396 

<210> 52 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 81, 189 

<223> n » A,T,C or G 

■<400> 52 

acctcgctaa gtgttcgcta cgcggggcta ccggatcggt cggaaatggc agaggtggag 60 
gagacactga agcgactgca nagccagaag ggagtgcagg gaatcatcgt cgtgaacaca 120 
gaaggcattc ccatcaagag caccatggac aaccccacca ccacccagta tgccagcctc 180 
atgcacagnt tcatcctgaa ggcacggagc accgtgcgtg acatcgaccc ccagaacgat 240 
ctcaccttcc ttcgaattcg ctccaagaaa aatgaaatta tggttgcacc agataaagac 300 
tatttcctga ttgtgattca gaatccaacc gaataagcca ctctcttggc tccctgtgtc 360 
attccttaat ttaatgcccc ccaagaatgt taatgt 396 

<210> 53 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 224, 225, 228, 235, 240, 246, 257, 266, 274, 279, 281, 282, 
283, 285, 287, 288, 290, 291, 292, 293, 294, 295, 296, 297, 
300, 301, 303, 307, 311, 313, 314, 317, 318, 319, 320, 321, 
323, 324, 328, 329, 330, 336, 337, 338, 339, 340, 341 
<223> n « A,T,C or G 
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<221> misc_feature 

<222> 342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, 356, 
357, 358, 359, 362, 363, 364, 365, 366, 367, 373, 380, 381, 
382, 385, 387, 388, 389, 390, 392 
<223> n = A,T,C or G 



<400> 53 

tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 60 
tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 120 
tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 180 
tttttttttt tttttttttt tttttttttt tttttttttt ttannttntt ttttnttttn 240 
cctttntttt aattcanaaa aagaanaaga aaanataana nnnancnnan nnnnnnnatn 300 
ntncttnata ntnnttnnnn nanngggnnn gcgagnnnnn nnnnnnnnnn nntctnnnnt 360 
tnnnnnnctt gcnccccttn nnttngnnnn angcaa 396 



<210> 54 
<211> 396 
<212> DNA 
<213> Homo 

<220> 

<221> misc 
<222> 367 " 
<223> n = J 



sapiens 

feature 
i,T,C or G 



<400> 54 

ctcttggggc tgctgggact cgcgtcggtt 
gccgggttct gaggccttgc ttctctttac 
ccagacctgg gaggagttca gccgcgctgc 
ggcacgtgtg gttctcaaat ataggcattc 
tgatttagtt tgtttggtgt ataaaacaga 
attccacagt caactaatgc gacttatggt 
aactgantga atggtttgaa atgaagactt 

<210> 55 

<211> 396 

<212> DNA 

<213> Homo sapiens 



ggcgactccc ggacgtaggt agtttgttgg 60 
ttttccactc taggccacga tgccgcagta 120 
cgagaagctt tacctcgctg accctatgaa 180 
tgatgggaac ttgtgtgtta aagtaacaga 24 0 
ccaagctcaa gatgtaaaga agattgagaa 300 
agccaaggaa gcccgcaatg ttaccatgga 360 
tgtcgt 396 



<400> 55 

cgacggtttg ccgccagaac acaggtgtcg 
aaggaaaaga ctcatatcaa cattgtcgtc 
actactggcc atctgatcta taaatgcggt 
gagaaggagg ctgctgagat gggaaagggc 
ctgaaagctg agcgtgaacg tggtatcacc 
agcaagtact atgtgactat cattgatgcc 
attacaggga catctcaggc tgactgtgct 

<210> 56 

<211> 396 

<212> DNA 

<213> Homo sapiens 



tgaaaactac ccctaaaagc caaaatggga 60 
attggacacg tagattcggg caagtccacc 120 
ggcatcgaca aaagaaccat tgaaaaattt 180 
tccttcaagt atgcctgggt cttggataaa 240 
attgatatct ccttgtggaa atttgagacc 300 
ccaggacaca gagactttat caaaaacatg 360 
gtcctg 396 



<220> 

<221> misc_feature 

<222> 134, 145, 255, 279, 337, 344, 369 
<223> n = A, T,C or G 
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<400> 56 

tttttttttt ttttttctca tttaactttt ttaatgggtc tcaaaattct gtgacaaatt 60 
tttggtcaag ttgtttccat taaaaagtac tgattttaaa aactaataac ttaaaactgc 120 
cacacgcaaa aaanaaaacc aaagnggtcc acaaaacatt ctcctttcct tctgaaggtt 180 
ttacgatgca ttgttatcat taaccagtct tttactacta aacttaaatg gccaattgaa 240 
acaaacagtt ctganaccgt tcttccacca ctgattaana gtggggtggc aggtattagg 300 
gataatattc atttagcctt ctgagctttc tgggcanact tggngacctt gccagctcca 360 
gcagccttnt tgtccactgc tttgatgaca cccacc 396 

<210> 57 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 52, 57, 58, 61, 72, 75, 77, 84, 87, 88, 93, 100, 101, ill, 
117, 119, 121, 131, i32, 133, 134, 142, 143, 154, 156, 159, 
167, 168, 170, 175, 176, 182, 183, 185, 186, 190, 192, 194, 
198, 199, 200, 209, 212, 217, 218, 220, 232, 235, 253 
<223> n = A,T,C or G 



<221> inisc_f eature 

<222> 255, 257, 258, 260, 262, 263, 270, 271, 273, 277, 280, 281, 
284, 285, 289, 296, 297, 298, 303, 305, 307, 309, 310, 317, 
322, 324, 337, 338, 342, 344, 346, 347, 349, 351, 356, 358, 
366, 368, 371, 377, 380, 388, 389, 393, 396 
<223> n = A,T,C or G 



<400> 57 

cctttttttt tttttttttt tttttttttt 
ntttttgcaa anccnancaa aaanggnngg 
ntttgggaac nnnnagccct tnntttgaaa 
gnnanncccn gntncttnnn tttaaaaana 
ttttttccct aanancnncn anntgaaacn 
atnanangnn aaaaaanccc tnancccccc 
ttgggncntg naaaaanaan cctttttnnt 

<210> 58 
<211> 396 
<212> DNA 

<213> Homo sapiens 



tttttttttt tttttttttt tnaaaanntt 60 
aangaaaaan nggaaaaatt ntttttncnt 120 
aaangnggnc ttaaaanngn tgaannaaag 180 
anggggnngn ttttttttaa anaanatttt 240 
ngncccnacn nctnncttna aagggnnnaa 300 
cccttanntt tncnannana naaagncntt 360 
gcnttn 396 



<400> 58 ' 

cgacctcaaa tatgccttat tttgcacaaa 
tacagcctcg atttatattt ctgtttgtgg 
aaagaggttt ttgaaatgcc tatggtttct 
ttccagtagt cagcaaagag cagtttgaat 
agactcgagc acagcaccca gacttcatgc 
aagcggccga ccactgactt tgtgacttag 
ttcaccccca ctccccgtac agtgcgcaca 

<210> 59 
<211> 396 
<212> DNA 

<213> Homo sapiens 



agactgccaa ggacatgacc agcagctggc 60 
tgaactgatt ttttttaaac caaagtttag 120 
ttgaatggta aacttgagca tcttttcact 180 
tttcttgtcg cttcctatca aaatattcag 24 0 
gcccgtggaa tgctcaccac atgttggtcg 300 
gcggctgtgt tgcctatgta gagaacacgc 360 
ggcttt 396 



<220> 
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<221> raisc__feature 

<222> 25, 45, 116, 178, 198, 211, 225, 235, 253, 266, 281, 324, 

367, 377, 389 

<223> n = A,T,C or G 



<400> 59 

cttttttttt tttttttttt tcagnggaaa 

aaatctgttc ctggcattaa gctccttctt 

ccatgaatgc tttcttctcc tccatggtct 

tgtcaatgaa cttaaggnca atcttctcca 

acttgcggag ggngagcacc cgcttnttgg 

agtcattggt cacttcacca tagnggacaa 

aataggncat agtcacngga ggcattgtnc 

<210> 60 

<211> 396 

<212> DNA 

<213> Homo sapiens 



ataactttta ttganacccc accaactgca 60 

cctttgcaat tcggtctttc ttcagnggtc 120 

ggaagcggcc atggccaaac ttggaggngg 180 

nagcccgccg cttcntctgc accancaagg 240 

ttcccaccac ncagcctttc agcatgacaa 300 

agccacccaa agggttgatg ctccttggca 360 
ttgatc 396 



<40(3> 60 

acctcagctc tcggcgcacg gcccagcttc 
gtgcaagctc agcttggagg gtgatcactc 
agcctatact aactttgatg ctgagcggga 
caaaggtgtg gatgaggtca ccattgtcaa 
acaggatatt gccttcgcct accagagaag 
gtcagcctta tctggccacc tggagacggt 
gtatgacgct tctgagctaa aagcttccat 

<210> 61 

<211> 396 

<212> DNA 

<213> Homo sapiens 



cttcaaaatg tctactgttc acgaaatcct 60 
tacaccccca agtgcatatg ggtctgtcaa 120 
tgctttgaac attgaaacag ccatcaagac 180 
cattttgacc aaccgcagca atgcacagag 240 
gaccaaaaag gaacttgcat cagcactgaa 300 
gattttgggc ctattgaaga cacctgctca 360 
gaaggg 396 



<400> 61 

tagcttgtcg gggacggtaa ccgggacccg 
tccctagcca ctatgcgtga gtgcatctcc 
ggcaatgcct gctgggagct ctactgcctg 
ccaagtgaca agaccattgg gggaggagat 
ggcgctggca agcacgtgcc ccgggctgtg 
gaagttcgca ctggcaccta ccgccagctc 
gaagatgctg ccaataacta tgcccgdggg 

<210> 62 

<211> 396 

<212> DNA 

<213> Homo sapiens 



gtgtctgctc ctgtcgcctt cgcctcctaa 60 
atccacgttg gccaggctgg tgtccagatt 120 
gaacacggca tccagcccga tggccagatg 180 
gactccttca acaccttctt cagtgagacg 240 
tttgtagact tggaacccac agtcattgat 300 
ttccaccctg agcagctcat cacaggcaag 360 
cactac 396 



<220> 

<221> misc_feature 

<222> 261, 269, 313, 333, 346, 354, 359, 390, 394, 395, 396 
<223> n = A,T,C or G 



<400> 62 

tcgacgtttc ctaaagaaaa ccactctttg 

actctgtaac atctttgtgg tagtcctgtt 

agattacaga tttgaacatg tagtgtacgt 

gtaacagctg accaacgtga agatactggt 

ccaaatttta agctggaaag ncactggant 



atcatggctc tctctgccag aattgtgtgc 60 

ttcctaataa ctttgttact gtgctgtgaa 120 

gctgttgagt tgtgaactgg tgggccgtat 180 

acttgatagc ctcttaagga aaatttgctt 240 

aactttaaaa aagaattaca atacatggct 300 



wo 02/39885 



22 



PCT/USOl/45395 



ttttagaatt tcnttacgta tgttaagatt tgngtacaaa ttgaantgtc tgtnctganc 360 
ctcaaccaat aaaatctcag tttatgaaan aaannn 396 

<210> 63 
<211> 396 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^f eature 

<222> 3, 11, 16, 18, 23, 26, 30, 34, 37, 50, 51, 60, 61, 62, 63, 
64, 75, 82, 83, 84, 85, 87, 89, 93, 94, 97, 98, 99, 118, 
119, 120, 122, 134, 136, 138, 139, 141, 144, 145, 147, 152, 
156, 187, 188, 193, 195, 204, 211, 214, 216, 222, 226 
<223> n « A,T,C or G 

<221> misc^f eature 

<222> 228, 235, 242, 258, 264, 265, 269, 275, 294, 298, 301, 307., 

316, 326, 334, 335, 339, 340, 343, 350, 351, 355, 373, 378, 

390 

<223> n » A,T,C or G 
<400> 63 

ttnttttttt nttttntntt ttntcnttgn ttgnacngaa cccggcgctn nttccccacn 60 
nnnnacggcc gcccntattc annnntncnt canntannna ccgcaccctc ggactgcnnn 120 
tngggccccg ccgncnannc nccnncnccc anttcnccgc cgccgccgcc gccttttttt 180 
attggcnncc atnanaaccg gggncacctc ncangngcgc cnaaantngg ggcangactc 240 
anagggggcc atcaaccncc aagnncaanc tgganctcta caaacggcct acgntttntg 300 
nccatgnggg tagggnttta cccgcnatga tgannatgnn aanaactttn ncaanccctt 360 
tattaaccaa tgnggtgngg agacggaacn tggtta 396 



<210> 64 
<211> 396 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 175, 177, .340, 393 
<223> n A, T,C or G 



<400> 64 

tcgacgtcgg ggtttcctgc ttcaacagtg cttggacgga acccggcgct cgttccccac 60 
cccggccggc cgcccatagc cagccctccg tcacctcttc accgcaccct cggactgccc 120 
caaggccccc gccgccgctc cagcgccgcg cagccaccgc cgccgccgcc gcctntnctt 180 
agtcgccgcc atgacgaccg cgtccacctc gcaggtgcgc cagaactacc accaggactc 240 
agaggccgcc atcaaccgcc agatcaacct ggagctctac gcctcctacg tttacctgtc 300 
catgtcttac tactttgacc gcgatgatgt ggctttgaan aactttgcca aatactttct 360 
tcccaatctc atgaggagaa ggaacatgct ganaaa 396 

<210> 65 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 26, 56, 103, 122, 145, 151, 154, 187, 189, 203, 224, 256, 
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273, 305, 344 

<223> n = A,T,C or G 

<400> 65 

tttttttttt tttttttttt tttttnacca 
atttatatgt tagttttagt acaagtacta 
gnatacataa gcttcaccat acatnttgca 
aatgtanana ttttaggaaa ctnttttatt 
gcactgacct gatgtnttat ttaaaagtaa 
tattnaaacg tatttccatg ttatccagct 
tctgagtcaa atctggactg aaatcctgat 

<210> 66 
<211> 396 

<212> DNA 

<213> Homo sapiens 



ataatgcttt tattttccac atcaanatta 60 
aaatgtatac ttnttgccct aatagctaag 120 
nccncctgtc tgtcctatgt cattgttata 180 
caacctggga catntatact gtaggagtta 240 
tgnatattac ctttacatat attccttata 300 
taaaatcaca tggnggttaa aagcatgagt 360 
gctccc 396 



<400> 66 

tcgacttttt tttttccagg 
atataagtta caacttgatt 
ttcttttatg tacaatatct 
tttctgctta atataacaag 
aaaccccccg ctcaactaca 
gtaacttcag cttttatcta 
aacaacaaac aacaaacaaa 

<210> 67 
<211> 396 

<212> DNA 

<213> Homo sapiens 



acattgtcat aattttttat 
aaagttgata gacatttgta 
tgtctagagt ctagcaaata 

caaaaacaaa caactgaaaa 
aatgtcaata ttgaatgaag 
gcaatgcaga atgaatacta 
acaaaacaaa caaaca 



tatgtatcaa attgtcttca 60 
tctatttaaa gacaaaaaaa 120 
tagtaccttt cattgcagga 180 
aatataaacc aaagcaaacc 24 0 
cattaaaaga caaacataaa 300 
aaattagtgg caaaaaaaca 360 

396 



<400> 67 

acgcttttgt ccttcatttt aactgttatg 
aagagaatag aggcaaaagt atagaactga 
tgaaacttca ccatattatg atcatacata 
tgttttttgt gttaggtgct ttggctgaac 
tgtgtatgtc tcttcatttc ctctcaaatc 
caggatgctg gcatctgtgt agtcctcata 
catggcagac ctaagctcag accctgcctt 

<210> 68 

<211> 396 

<212> DNA 

<213> Homo sapiens 



tcatactgtt atgttgacat atttctttat 60 
ggatcatttg tatttttgag ttggaaatta 120 
ttttgaagaa cagactgacc aaagctcacc 180 
ttgattccag cccccttttc cctttggtgt 240 
ttcaactctt gccccatgtc tccttggcag 300 
ctgtttactg ataacccaca aattcatttt 360 
gtcctg 396 



<400> 68 

acctgagtcc tgtcctttct ctctccccgg 
tctccaccaa ctaccggtcc ctgggctctg 
tcagcagcgc ggccagcgtc tatgcaggcg 
cccgctccac cagcttcagg ggcggcatgg 
ggggtctggc aggaatggga ggcatccaga 
accgcctggc ctcttacctg gacagagtga 
agagcaaaat ccgggagcac ttggagaaga 

<210> 69 • 
<211> 396 

<212> DNA 

<213> Homo sapiens 



acagcatgag cttcaccact cgctccacct 60 

tccaggcgcc cagctacggc gcccggccgg 120 

ctgggggctc tggttcccgg atctccgtgt 180 

ggtccggggg cctggccacc gggatagccg 240 

acgagaagga gaccatgcaa agcctgaacg 300 

ggagcctgga gaccgagaac cggaggctgg 360 
agggac 396 
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<220> 

<221> misc^feature 

<222> 1, A, S, B, 9, 11, 18, 19, 36, 53, 60, 64, 79, 84, 92, 94, 
97, 105, 114, 120, 123, 127, 129, 134, 137, 138, 139, 142, 
143, 147, 149, 151, 152, 156, 158, 167, 170, 172, 180, 182, 
184, 187, 188, 189, 194, 197, 201, 209, 212, 218, 219 
<223> n = A,T,C or G 

<221> inis cofeature 

<222> 220, 222, 223, 225, 228, 229, 230, 232, 233, 236, 242, 244, 
247, 250, 251, 253, 256, 257, 259, 261, 270, 271, 274, 277, 
278, 279, 282, 284, 28.8, 289, 296, 298, 300, 310, 315, 316, 
320, 321, 324, 328, 330, 331, 334, 336, 340, 347, 350 
<223> n = A,T,C or G 

<221> inisc_feature 

<222> 352, 353, 355, 359, 361, 362, 364, 367, 370, 372, 374, 376, 
382, 388, 390, 394, 396 
<223> n = A,T,C or G 

<400> 69 

ntcncngnng ntgtggtnnt ttttttaatt tttatntttt cttttttttt ctngctagcn 60 

cttncttttt ttggaattnc ggtncctttt tntntcnatt ttttngacaa aaanaacctn 120 

ttntttnana ccanagnnng gnncacncnt nnaatntncc ccttttncgn tngggagctn 180 

cncnttnnnc gccnacntca ntcgagacng tncttttnnn tnnancannn tnngtncgtt 240 

gncngcnttn ntncannant nttccctatn nacntgnnnt cncncatnnt tggacnancn 300 

cctagccttn ccatnntttn nttntttntn natnancctn gaaaacntcn gnntnttcnc 360 

nncnttnccn cncncncctt cntatgtncn atgncn 396 

<210> 70 

<211> 396 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<222> 15, 38, 57, 59, 63, 64, 65, 66, 68, 78, 79, 84, 87, 90, 97, 
114, 115, 127, 128, 141, 143, 145, 151, 159, 168, 169, 172, 
173, 176, 178, 197, 198, 207, 209, 211, 215, 220, 221, 223, 
225, 228, 240, 248, 249, 260, 262, 263, 273, 283, 287 
<223> n = A,T,C or G 

<221> misc^f eature 

<222> 294, 304, 314, 334, 339, 340, 348, 362, 367, 376, 382, 384, 
386, 395 

<223> n = A,T,C or G 
<400> 70 

tttttttttt ttttnttttt tttttttttt tttttttntt tttttttttt ttttttntnc 60 
aannnntnaa cttttaanng gccnccngcn ccccaanggg gaccctgctt ttgnnggcta 120 
aatgccnnaa aactttgggg nantnggtat naaaccccnc tttgcccnnc annttncngg 180 
gggggggggg tttttgnngg ggaacangna naacnttttn ncnanggnat caccaaaaan 240 
aaagcccnnc cctttttccn annggggggg ggngggggga aantcanccc ccanattgac 300 
cttnatttca aaanggggct tataatcctg ggcntggann cttccctnta cccgggggtt 360 
gnccacnttt tattanaggg gnangnggat ccccnt 396 



<210> 71 
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<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> inisc_feature 

<222> 15, 21, 30, 33, 35, 36, 42, 43, 44, 45, 46, 51, 56, 58, 59, 
63, 70, 77, 81, 88, 94, 95, 96, 97, 101, 102, 109, 114, 
118, 119, 120, 124, 131, 132, 133, 134, 135, 141, 142, 143, 
144, 145, 146, 148, 149, 154, 158, 162, 164, 166, 172 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 177, 179, 181, 184, 185, 213, 216, 218, 219, 222, 223, 224, 
230, 231, 240, 241, 242, 245, 247, 251, 252, 255, 258, 259, 
261, 264, 268, 269, 272, 276, 285, 288, 289, 291, 292, 293, 
297, 299, 300, 307, 312, 315, 316, 317, 325, 329, 334 
<223> n « A,T,C or G 

<221> misc_feature 

<222> 340, 341, 347, 350, 354, 355, 357, 360, 361, 367, 368, 370, 
371, 376, 377, 378, 387, 393, 394 
<223> n = A,T,C or G 

<400> 71 

gcatctagag ggccngttta 
gcncctgctn gttgccnccc 
gaangtgcct nnnnnactga 
natnnggggt gatgctattc 
nnatnangag nnatntcnng 
ggactcntcg cncannnatc 
ngtactnnan ngttgnnntc 



ntctagaggn ccngnntaaa 
ntctgtgnct tgcnnnnccc 
nnnnnncnna taanatgngg 
tggggggtgg ggnggngnna 
nttntctnnt gntttntggg 
aatancttna ttcngtgtan 
attactnttc gtnngg 



cnnnnncatc nacctncnnt 60 
nngagcgtnc cttnaccnnn 120 
anantncgtc gncattntnt 180 
tnnnatactn nggggacgtn 240 
gggcnatnng nnntctntnn 300 
ngtccgnccn tagnncngcn 360 

396 



<210> 72 
<211> 396 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

.<222> 2, 23, 27, 34, 35, 36, 37, 39, 41, 45, 55, 56, 59, 61, 88, 
92, 96, 97, 98, 101, 103, 104, 106, 108, 111, 114, 115, 
121, 128, 129, 131, 159, 170, 191, 202, 227, 233, 235, 240, 
262, 268, 271, 272, 280, 281, 303, 304, 305, 311, 316, 317 
<223> n = A,T,C or G 

<221> masc_feature 

<222> 321, 324, 336, 344, 345, 353, 360, 362, 363, 364, 365, 366, 
370, 373, 389, 391, 392, 394, 395 
<223> n = A, T,C or G 

<400> 72 

tntttttttt tttctaaaac atnactnttt attnnnnang ntttntgaac ctctnngcnt 60 
natggtgaga gtttgtctga ttaataanaa tnggannntt nannanangc ntgnncgcaa 120 
ngatggcnnc nctgtatatc ccaccatccc attacactnt gaaccttttn tttgattaat 180 
aaaaggaagg natgcgggga anggggaaag agaatgcttg aacattncca tgngnccttn 240 
gacaaacttt ccaatggagg cnggaacnaa nnaccaccan ncaactcccc tttttgtaat 300 
ttnnnaactt ncaacnncta nctntttatt ttggcntccc tggnngaaac agnctgtatn 360 
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annnnnaagn ccntgagaac atccctggnt nncnna 396 

<210> 73 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 1, 1, 9, 14, 23, 35, 38, 44, 48, 50, 61, 74, 76, 79, 80, 

85, 86, 91, 95, 101, 109, 112, 113, 117, 118, 121, 122, 

127, 129, 132, 137, 141, 146, 214, 234, 243, 251, 266, 296, 

305, 306, 336 

<223> n = A,T,C or G 



<400> 73 

ntcaacntng actnctgtga ggnatggtgc 
naccttatgg acantngcnn tcccnnggaa 
nnttccntnt cnaaaangtt naaaancatt 
ctactcttga gtgctataat gatgaagatc 
aanatggtga nttggggaac atatanagta 
caggnnctaa agagcaagcc taactcaagc 
tgtacttctc acttttaagg gattagaaaa 

<210> 74 

<211> 396 

<212> DNA 

<213> Homo sapiens 



tgggngcnta tgcngtgngn ttttggatac 60 
ngatnataat ncttactgna gnnactnnaa 120 
ggatgtgcca .caatgatgac agtttatttg 180 
ttanccacca ttatcttaac tgangcaccc 240 
cacctaagtt cacatgaagt tgtttnttcc 300 
cattgncaca caggtgagac acctctattt 360 
tagcca 396 



<220> 

<221> misc_feature 

<222> 22, 118 

<223> n = A,T,C or G 

<400> 74 

cctttttttt tttttttact gngaatatat actttttatt tagtcatttt tgtttacaat 60 
tgaaactctg ggaattcaaa attaacatcc ttgcccgtga gcttcttata gacaccanaa 120 
aaagtttcaa ccttgtgttc cacattgttc tgctgtgctt tgtccaaatg aacctttatg 180 
agccggctgc catctagttt gacgcggatt ctcttgccca caatttcgct tgggaagacc 240 
aagtcctcaa ggatggcatc gtgcacagct gtcagagtac ggctcctggg acgcttttgc 300 
ttattttttg tacggctttt tcgagttggc ttaggcagaa ttctcctctg agcgataaag 360 
acgacatgct tcccactgaa ctttttctcc aattcg 396 

<210> 75 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 14, 38, 41, 43, 47, 53, 73, 75, 78, 83, 96, 112, 113, 117, 
124, 127, 146, 160, 167, 169, 176, 177, 178, 179, 194, 197, 
198, 209, 210, 220, 222, 226, 227, 231, 238, 241, 244, 258, 
259, 260, 270, 271, 274, 288, 301, 302, 305, 307, 316 
<223> n = A,T,C or G 



<221> misc_feature 

<222> 319, 328, 339, 344, 347, 354, 359, 364, 367, 369, 370, 371, 
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373, 374, 381, 384, 387, 388 
<223> n = A,T,C or G 



<400> 75 

tttttttttt tttntttttt tttttttttt 
tttttttaaa ctngnccntt ttnctttcct 
ttcnttnaaa aacccttttt cccacnaaca 
aaaaaaaggg gctnggnntt tccccttann 
ntcnccgggg gggaaacnnn tgggggtgtn 
nnccncncct aaagangtnt ttcaaaanaa 
aaanaangnn ngnntttttt ntcnttnncc 

<210> 76 
<211> 396 
<212> DNA 

<213> Homo sapiens 



tttttttnaa ntntaanggg ganggcccct 60 
tttttnaaaa ggaaaaaaaa anntttnttt 120 
aaaaaaaccn ttccccntnc cttttnnnna 180 
caaaaaaccn tntccnnggg naaaaaantt 24 0 
nccnaaattt gggggccntc ggaagggggg 300 
aaacccccnt cctnttntaa aaanaaaana 360 
ccccaa 396 



<220> 

<221> raisc_feature , 

<222> 87, 94, 102, 108, 138, 139, 143, 144, 145, 146, 151, 152, 

158, 168, 170, 171, 187, 204, 206, 224, 261, 262, 267, 268, 

270, 287, 305, 306, 313, 315, 319, 320, 330, 331, 333, 342, 

344, 348, 349, 356, 358, 360, 362, 368, 374, 376, 381 
<223> n = A,T,C or G 



<221> inisc_feature 
<222> 390 

<223> n = A,T,C or G 



<400> 76 

acattcttca gaaatacagt gatgaaaatt cattttgaaa ctcaaatatt 

atattctcct gtttttatta aaccagngat tacncctggc cntccctnta 

gaaggcatgt ctgttgtnnt ttnnnnaaaa nnaaattntt tttttttngn 

atcccanttt atcaggaagt tagncnaatg aaatggaaat tggntaatgg 

gcttgtaaaa aggaccaccc nnccacnngn ctttaccccc ttggttngtt 

ccatnnttaa ccntntggnn aaaattgggn ncntaaagtt tncntggnna 

cngtattnaa ttgncnttat nggaaaatcn gggatt 



ttcattttgg 60 
aatgttctag 120 
naaaccccaa 180 
acaaaagcta 240 
gggggaaaaa 300 
acagtncntn 360 
396 



<210> 77 
<211> 396 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 63, 66, 81, 83, 89, 107, 115, 118, 147, 151, 190, 232, 275, 
288, 294, 304, 323, 332, 369, 392 
<223> n « A,T,C or G 

<400> 77 

tttttttttt tttttttttt tttttttttt tatcaacatt tatatgcttt attgaaagtt 60 

ganaanggca acagttaaat ncngggacnc cttacaattg tgtaaanaac atgcncanaa 120 

acatatgcat ataactacta tacaggngat ntgcaaaaac ccctactggg aaatccattt 180 

cattagttan aactgagcat ttttcaaagt attcaaccag ctcaattgaa anacttcagt 240 

gaacaaggat ttacttcagc gtattcagca gctanatttc aaattacnca aagngagtaa 300 

ctgngccaaa ttcttaaaat ttntttaggg gnggtttttg gcatgtacca gtttttatgt 360 

aaatctatnt ataaaagtcc acacctcctc anacag 396 
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<210> 78 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 8, 14, 16, 20, 26, 28, 36, 38, 39, 40, 51, 52, 55, 57, 58, 
67, 71, 114, 120, 132, 138, 142, 159, 165, 169, 172, 174, 
175, 183, 187, 195, 197, 198, 200, 202, 206, 209, 243, 259, 
260, 267, 283, 292, 305, 311, 315, 317, 319, 323, 324 
<223> n = A,T,C or G 

<221> iaisc_feature 

<222> 331, 333, 334, 338, 343, 348, 353, 355, 357, 366, 376, 388 
<223> n * A,T,C or G 

<400> 78 

agctggcnaa aggngnatgn gctgcnangc gattangnnn ggtaacgtca nnggntnncc 60 
agtgcangac nttgtaaaac gacggccaca tgaattgtaa tacgactcac tatngggcgn 120 
attgggccgt gnaggatngt gntcacactc gaatgtatnc tggcngatnc ananngcttt 180 
atngctnttg acggngnntn anccanctng ggctttaggg ggtatcccct cgcccctgct 240 
tcnttgattt gcacgggcnn ctccganttc cttcataata ccngacgctt cnatdcccta 300 
gctcngacct ntcantntnt tcnntgggtt ntnnccgntc acngcttncc cgnangntat 360 
aatctnggct cctttnggga tccattantc tttact 396 

<210> 79 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 116, 153, 189, 194, 210, 218, 241, 270, 272, 288, 291, 304, 
324, 325, 329, 333, 334, 338, 340, 342, 366, 372, 377, 384, 
396 

<223> n = A,T,C or G 
<400> 79 

caccaaccaa aacctggcgc cgttggcatc gtagagtgaa cacaacccaa aaacgatacg 60 
ccatctgttc tgccctggct gcctcagccc taccagcact ggtcatgtct aaaggncatc 120 
gtattgagga agttcctgaa cttcctttgg tangttgaag ataaagctga aggctacaag 180 
aagaccaang aagntgtttt gctccttaan aaacttanac gcctggaatg atatcaaaaa 240 
ngctatgcct ctcagcgaat gagactggan angcaaaatg agaaaccntc nccgcatcca 300 
gcgnaggggc cgtgcatctc tatnntgang atnntggnan cnttcaaggc cttcagaacc 360 
tccctngaaa tnctctnctt taangaacca aactgn 396 

<210> 80 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 312, 319, 353, 383 
<223> n « A,T,C or G 



<400> 80 
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tgtacatagg catcttattc actgcaccct 
tatttgaaag actgggaatt taatggttag 
cgactgtccc ctctaaagtt aaagtcaata 
aaaggctgtg aagaaaattc attttacatt 
ttgacatgag cacctttaga tccttccctt 
aggcctgtaa anggattgna atttcctata 
ttgagtaagc cctccaagat acnttcaata 

<210> 81 
<211> 396 
<212> DNA 

<213> Homo sapiens 



gtcacaccca gcaccccccg ccccgcacat 60 
ggacagtaaa tctacttctt tttccaggga 120 
caagaaaact gtctattttt agcctaaagt 180 
gggtagacag taaaaaacaa gtaaaataac 240 
catggggctt tgggcccaga atgacctttg 300 
agctgtatag tggagggatt ggngggtcat 360 
cctggg 396 



<220> 

<221> misc_f eature 

<222> 240, 286, 361, 364, 374, 375, 379, 380, 381, 387 
<223> n = A,T,C or G 

<4b0> 81 

gcagctgaag ttcagcaggt gctgaatcga ttctcctcgg cccctctcat tccacttcca 60 
acccctccca ttattccagt actacctcag caatttgtgc cccctacaaa tgttagagac 120 
tgtatacgcc ttcgaggtct tccctatgca gccacaattg aggacatcct gcatttcctg 180 
ggggagttcg ccacagatat tcgtactcat ggggttcaca tggttttgaa tcaccagggn 240 
ccgccatcag gagatgcctt tatccagatg aagtctgcgg acagancatt tatggctgca 300 
cagaagtggc ataaaaaaaa catgaaggac agatatgttg aagttttcag tgtcagctga 360 
nganagaaca ttgnngtann ngggggnact ttaaat 396 

<210> 82 

<211> 396 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> iaisc_feature 

<222> 220, 251, 297, 301,. 309, 349, 395 
<223> n = A,T,C or G 



<400> 82 

gactcagaaa 

agcagccgca 

catcccacat 

ttggccgttt 

aagtaggaaa 

ngaagtacnc 

atttccacca 



tgtcagtctc 
gccaagcaag 
aacttggggt 
catttgcttc 
ngtgcttttg 
tatgtgccac 
cggaatgacc 



atgaagttca 
taacttgtaa 
tagagcacaa 
caccagttct 
gagaaaatca 
ctaatcctca 
agtgctttgg 



aaagatcgag 
aatgaggaat 
gcgttcccag 
ggaaagagan 
cctgctcctc 
tatatgacct 
gtaana 



aatgtttgct 
gccatcaccc 
gaactactca 
ggcctagaag 
agaactgggc 
caagagacnc 



atcttggtgg 60 
ctcgagtgtc 120 
ccttaccatc 180 
ttcaaaaaaa 240 
ttacaanctg 300 
caataagcat 360 
396 



<210> 83 
<211> 396 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 13, 372, 379, 393 
<223> n = A,T,C or G 



<400> 83 

tttgatttaa ganatttatt atttttttaa 



aaaaagcaac ttccagggtt gtcattgtac 



60 
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aggttttgcc cagtctccta tagcatggta tagtgataac tgatttttta taacaatgac 120 

tcagaggcat tgaagatcca taactatctt ctgaattatc acagaaagaa gaaagttaga 180 

agagtttaat gttaagtgta ttaaaaatca tattctaatt cttttaattt ggttatctga 240 

gtatgataat ataggagagc tcagataaca aggaaaaggc attggggtaa gaacactcct 300 

tcccacagga tggcattaac agactttttc tgcatatgct ttatatagtt gccaactaat 360 

tcacctttta cncagcttna ttttttttta ctnggg 396 

<210> -84 

<211> 396 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 61, 232, 254, 270, 271, 286, 354, 356, 368, 374, 389, 394 
<223> n = A,T,C or G 

<400> 84 

tttttacagc aatttttttt tattgatgtt taacctgtat acaaccatac ccattttaag 60 
ngtacagaca aatgaatttt gacaaattca ttcactcatc taatcatcac tataaccatg 120 
atacagattt ttatcactcc aaaagtccat cctgtgctct tttcaagtcc atcctcctca 180 
tctgataccc caagccacca ttgttttgct ttctggaact acagttttgg gnttttagaa 240 
tttcatatat ggtngaatca taccatttgn natttggggc tgacgncttt cctccaataa 300 
tggatttgag aattatctac attttgcatg gatcctgggt tatttatacc aacnangggt 360 
tattatgnaa aatnggacca caatttggng gcanta . 396 

<210> 85 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 293, 305, 306, 317, 347, 357, 372, 377, 386, 391 
<223> n « A,T,C or G 



<400> 85 

cagtgaccgt gctcctaccc agctctgctc cacagcgccc acctgtctcc gcccctcggc 60 

ccctcgcccg gctttgccta accgccacga tgatgttctc gggcttcaac gcagactacg 120 

aggcgtcatc ctcccgctgc agcagcgcgt ccccggccgg ggatagcctc tcttactacc 180 

actcacccgc agactccttc tccagcatgg gctcgcctgc aacgcgcagg acttctgcac 240 

ggacctggcc gctccagtgc caacttcatt ccacggcact gcatctcgac canccggact 300 

tgcannggtt ggggaanccg cccttgtttc tccgtggccc atctaanacc aaacccntca 360 

ccttttcgga gnccccnccc ctccgntggg nttact 396 

<210> 86 

<211> 396 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 5, 6, 28, 50, 58, 90, 108, 110, 118, 145, 154, 194, 244, 
285, 292, 300, 312, 315, 342, 344, 346, 359, 374, 378, 380, 
396 

<223> n « A,T,C or G 



<400> 86 
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ttttnnactg aatgtttaat acatttgnag gaacagaaga aatgcagtan ggattaanat 60 

tttataatta gacattaatg taacagatgn ttcatttttc aaagaagntn cccccttntc 120 

cctatctttt tttaatcttc cttanagcaa taantagtaa . ttactatatt tgtggacaag 180 

ctgctccact gtgntggaca gtaattatta aatctttatg tttcacatca ttattacctt 240 

ccanaattct accttcattt ccctgcacag gttcactgga ctggntcaca ancaaattgn 300 

actccactca antanaagag cccaaagaaa ttagagtaac gncnantcct atgaattana 360 

gacccaaaga tttnaggngn tgattagaaa cataan 396 

<210> 87 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<222> 231, 277, 285, 296, 341, 351, 372, 377, 380 
<223> n = A,T,C or G 



<400> 87 

atggaggcgc tggggaagct gaagcagttc gatgcctacc ccaagacttt ggaggacttc 60 

cgggtcaaga cctgcggggg cgccaccgtg accattgtca gtggccttct catgctgcta 120 

ctgttcctgt ccgagctgca gtattacctc accacggagg tgcatcctga gctctacgtg 180 

gacaagtcgc ggggagataa actgaagatc aacatcgatg tactttttcc ncacatgcct 240 

tgtgcctatc tgagtattga tgccatggat gtggccngag aacancagct ggatgnggaa 300 

cacaacctgt ttaagccacc actagataaa gatgcatccc ngtgagctca nagctgagcg 360 

gcatgagctt gngaaantcn aggtgaccgg gtttga 396 

<210> 88 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 246, 266, 301, 328, 347, 349, 368, 370, 371, 374, 379, 387, 
391 

<223> n = A,T,C or G 



<400> 88 

tccagagcag agtcagccag catgaccgag cgccgcgtcc ccttctcgct cctgcggggc 60 

cccagctggg accccttccg cgactggtac ccgcatagcc gctcttcgac caggccttcg 120 

ggctgccccg gctgccggag gagtggtcgc agtggttagg cggcagcagc tggccaggct 180 

acgtgcgccc cctgcccccc gccgcatcga gagccccgca gtggccgcgc ccgctacagc 240 

cgcgcngctc agccggcaac tcacancggg gctcggagat ccgggacact gcggaccgct 300 

ngcgcgtgcc ctggatgtca ccactttngc ccggacaact gacggtnana caaggatggg 360 

gggtgganan nccngtaanc caagaanggg naggac 396 

<210> 89 

<211> 396 

<212> DIQA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 37, 76, 230, 295, 306, 333, 346, 370, 376, 377, 395 
<223> n A,T,C or G 



<400> 89 



wo 02/39885 



32 



PCTAJSOl/45395 



gagagaacag taaacatcca gccttagcat 
tatattcaca tggagnaatg ctattcaacc 
ctttgaccaa tgttcctaaa ttcactctgc 
atctttcctc cttttttcag tttctaactt 
atctcttttc ttttacattt tggggtgtcc 
ttcttnttct tttgaattgg ctgcccttta 
ctccctcctn tttccnntca ttctactctc 



ctctcangag tactgcagat cttcattagc 60 
tatttctctt atcaaaacta attttgtatt 120 
ttctctatct caatcttttt cccctttctc 180 
tcactggttc tttggaatgn tttttctttc 240 
cctctctttt cttaccctct ttctncatcc 300 
tcntctcatc tgctgncatc ttcatttctc 360 
tcccnt 396 



<210> 90 

<211> 396 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^feature 

<222> 82, 110, 115, 120, 121, 125, 126, 129, 131, 140, 141, 144, 

145, 146, 148, 149, 150, 153, 154, 157, 158, 160, 161, 163, 

164, 166, 170, 172, 173, 174, 175, 179, 182, 184, 189, 193, 

194, 195, 200, 206, 213, 215, 217, 218, 219, 220, 227 
<223> n « A,T,C or G 



<221> misc_feature 

<222> 228, 231, 233, 236, 241, 247, 248, 249, 250, 254, 259, 262, 
269, 273, 274, 275, 280, 281, 282, 286, 287, 289, 293, 294, 
301, 302, 304, 309, 311, 318, 319, 324, 325, 330, 331, 333, 
334, 336, 337, 341, 342, 343, 344, 349, 352, 353, 358 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 361, 365, 367, 373, 377, 381, 385, 386, 387, 392 
<223> n « A,T,C or G 

<400> 90 

gggcgccggc gcgccccccc acccccgccc cacgtctcgt cgcgcgcgcg tccgctgggg 60 
gcggggagcg gtcgggccgg cngcggtcgg ccggcggcag ggtggtgcgn tttcnttttn 120 
nattnnccnc nttcttcttn nttnnncnnn ctnntanncn ntnncnttcn cnnnntttnc 180 
tntntcttna ccnnnttttn taatcntctt ctncntnnnn tctcttnnat ntnttnctta 240 
nttcctnnnn tttnttctnt cntttctcnc ctnnntctcn nnctcnncnc tcnncatttt 300 
nntnttttnt nccttctrint cttnnttctn ntnntnnttt nnnnttctnt tnntcatntt 360 
ncctntntta ctntcanctt ntatnnncct cntttt 396 



<210> 91 
<211> 396 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 1, 3, 8, 9, 16, 17, 18, 21, 22, 32, 33, 45, 50, 63, 64, 68, 
75, 82, 92, 95, 98, 102, 106, 108, 110, 111, 116, 121, 135, 
151, 154, 158, 162, 167, 170, 176, 181, 185, 187, 209, 212, 
215, 225, 231, 245, 257, 278, 283, 288, 290, 292, 293 
<223> n = A,T,C or G 

<221> misc^feature 

<222> 312, 324, 326, 330, 331, 333, 334, 344, 345, 349, 351, 352, 
357, 358, 382, 384, 390, 392 
<223> n = A, T,C or G 
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<400> 91 

ntntcctnna tttttnnntc 
tcnncacnta aaacngcgga 
naaccccaaa aatancccaa 
ntttntnaac aaggggggtt 
ttganaaatt ttttaanagg 
aatttttttt cnttggggct 
tttttttttt tgccttcacc 



nncttttttt tnnaattttt 
anaggggatt tnttnttngg 
aatgcacagg nccngggnaa 
ttaaagggna tnggnatcaa 
cttgcccccc actttggncc 
cccngncccn nannttccgg 
cntnccattn cntttt 



ctttnttttn tttataaaaa 60 
gngtancncn nggccncaaa 120 
angaccnacn tgggtntttt 180 
agggnataaa ntttaaacct 240 
ccnccccncn gnngggatcc 300 
gttnntggnc nntcctnntt 360 

396 



<210> 92 

<211> 396 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 3, 1, 8, 9, 11, 31, 149, 152, 221, 233, 259, 263, 264, 265, 
266, 274, 278, 279, 283, 286, 294, 302, 307, 309, 310, 311, 
314, 316, 320, 343, 351, 363, 372, 377, 386, 393 
<223> n = A,T,C or G 

<400> 92 

ctntttnnnt ntttttttcc ccatcatcca naaatgggtt ttattctcag ccgagggaca 60 

gcaggactgg taaaaactgt caggccacac ggttgcctgc acagcacccc catgcttggt 120 

agggggtggg agggatggcg ggggctggnt gnccacaggc cgggcatgac aaggaggctc 180 

actggaggtg gcacactttg gagtgggatg tcgggggaca ncttctttgg tanttgggcc 240 

acaagattcc caaggatanc acnnnnactg attnccannc tanagncaag cggntggcca 300 

tntgtangnn nttntntatn tgactattta tagattttta tanaacaggg naagggcata 360 

ccncaaaagg gnccaanttt ttaccnccgg gcnccc 396 

<210> 93 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> 290, 304, 313, 320, 325, 333, 337, 348, 351* 
<223> n = A,T,C or G 

<400> 93 

gctgccacag atctgttcct ttgtccgttt ttgggatcca caggccctat gtatttgaag 60 
ggaaatgtgt atggctcaga tcctttttga aacatatcat acaggttgca gtcctgaccc 120 
aagaacagtt ttaatggacc actatgagcc cagttacata aagaaaaagg agtgctaccc 180 
atgttctcat ccttcagaag aatcctgcga acggagcttc agtaatatat cgtggcttca 240 
catgtgagga agctacttaa cactagttac tctcacaatg aaggacctgn aatgaaaaat 300 
ctgnttctaa ccnagtcctn tttanatttt agngcanatc cagaccancg ncggtgctcg 360 
agtaattctt tcatgggacc tttggaaaac tttcag 396 

<210> 94 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 115, 204, 205, 243, 266, 276, 316, 319, 355, 357, 364 
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<223> n = A,T,C or G 



<400> 94 

tgccttaacc agtctctcaa gtgatgagac agtgaagtaa aattgagtgc actaaacgaa 60 
taagattctg aggaagtctt atcttctgca gtgagtatgg cccaatgctt tctgnggcta 120 
aacagatgta atgggaagaa ataaaagcct acgtgttggt aaatccaaca gcaagggaga 180 
tttttgaatc ataataactc atanngtgct atctgtcagt gatgccctca gagctcttgc 240 
tgntagctgg cagctgacgc ttctangata gttagnttgg aaatggtctt cataataact 300 
acacaaggaa agtcanccnc cgggcttatg aggaattgga cttaataaat ttagngngct 360 
tccnacctaa aatatatctt ttggaagtaa aattta 396 

<210> 95 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> roisc_f eature 

<222> 11, 16, 31, 36, 42, 49, 53, 56, 57, 60, 67, 70, 84, 89, 91, 
92, 99, 105, 106, 112, 120, 121, 125, 127, 128, 133, 137, 
141, 151, 152, 153, 154, 155, 162, 166, 167, 168, 174, 177, 
179, 186, 188, 194, 195, 199, 203, 205, 213, 217, 221 
<223> n = A,T,C or G 

<221> misc^feature 

<222> 227, 232, 235, 236, 240, 242, 260, 261, 265, 266, 291, 297, 
318, 325, 330, 339, 348, 351, 352, 354, 356, 362, 364, 372, 
380, 392, 395, 396 
<223> n «» A,T,C or G 



<400> 95 

cctcccaccc ncttanttca tgagattcga 
tattctnacn atttctttct tggngcggna 
ncttntnntt tcntggngct ntcccttttc 
ttctgnangg gggnntccna aananttttt 
cnaattgttt cccccccccn ntagnntatt 
attttcccta aaatttcncc tccanatttn 
tntnaatttt tnaaaaaaan ttattttcat 



naatgncact tntgtgctnt ttnctnnttn 60 
nnaatcccnt ttttnngggc gnctctcccn 120 
nnhnnaaact tntacnnngt ttanaantnt 180 
ccncctncct nattccnctc tnaannctcn 240 
ttttctaaaa aattaactcc nacgganaaa 300 
gaaaaaacnc gcccgganct nntntncgaa 360 
cnggnn 396 



<210> 96 
<211> 396 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^feature 

<222> 161, 193, 253, 259, 281, 288, 299, 309, 318, 319, 335, 340, 
344, 352, 355, 356, 387, 396 
<223> n » A,T,C or G 



<400> 96 

cctgggtacc aaatttcttt atttgaagga 
ttttggacaa cttatagaaa aggtaaagga 
ccagggaagt caccccacgg ctatggggaa 
tgcttccaag ggngtgcttg gcaaaaaaat 
tgcccagttg gtnccgggnc cccaattctt 
ctcatgaant cccccaanng gggcattttg 
tgatccattt tttccaatgt aatgccnctt 



atggtacaaa tcaaagaact taagtggatg 60 
aaccccaaca tgcatgcact gccttggcga 120 
attagcccga ngcttaactt tcattatcac 180 
attccgccaa ccaaatcggg cgctccatct 240 
ggatgctttc ncctcttntt ccggaatgng 300 
ccagnggccn tttngccatt cnagnnggcc 360 
cattgn 396 
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<210> 97 

<211> 396 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 13, 15, 16, 19, 23, 31, 38, 39, 41, 45, 68, 94, 95, 100, 
119, 131, 133, 141, 144, 164, 171, 182, 186, 190, 191, 195, 
196, 198, 213, 229, 231, 235, 239, 247, 257, 265, 269, 272, 
278, 279, 286, 289, 291, 306, 309, 310, 312, 317, 320 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 321, 327, 328, 337, 340, 343, 351, 360, 361, 368, 375, 381, 
385, 386, 387, 388 
<223> n = A,T,C or G 

<400> 97 

ctcaccctcc tcntnnttnt canaatattg ngaacttnnt nctgntcgaa tcactggcat 60 
taaagganca ctagctaatg gcactaaatt tacnnactan ggaaactttt ttataatant 120 
gcaaaaacat ntnaaaaaga ntgnagttcg cccatttctg cttnggaaga nctcttcact 180 
tntaancccn natgnngncc tttgggtcaa aanctccgcg attattacng ngttncccnc 240 
tatttgncct tcctttntcc ccaangccnc anatttcnna actttnccnt naaatgcctt 300 
tatttnatnn cntttcnacn ncttaanntt ccctttnaan aangatccct ncttcaaatn 360 
ntttcccngt tcctngcatt ncccnnnnat ttctct 396 

<210> 98 
<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__feature 

<222> 130, 202, 285, 296, 299, 308, 314, 321, 322, 336, 373 
<223> n *= A,T,C or G 



<400> 98 

acagggacaa tgaagccttt gaagtgccag tctatgaaga ggccgtggtg ggactagaat 60 
cccagtgccg cccccaagag ttggaccaac caccccctac agcactgttg tgataccccc 120 
agcacctgan gaggaacaac ctaccatcca gaggggccag gaaaagccaa actggaacag 180 
aggcgaatgg ctcagagggg tncatggcca agaaggaagc cctggaagaa cttcaatcac 240 
cttcggtttc gggaccaccg gcttgtgtcc ctgttctgac tgcanaactt ggcgcngtnc 300 
cccattanaa cctntgactc nncccttgct ataagnctgt tttggcccct gatgatgata 360 
gggtttttat gangacactt gggcaccccc ttaatg 396 

<210> 99 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<226> 

<221>. misc_f eature 

<222> 1, 4, 13, 15, 26, 31, 43, 46, 48, 52, 54, 55, 60, 62, 68, 
72, 93, 112, 118, 119, 122, 131, 132, 133, 134, 145, 147, 
152, 157, 163, 164, 186, 190, 225, 231, 239, 246, 247, 250, 
255, 262, 285, 314, 316, 319, 325, 332, 339, 343, 345 
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<223> n = A,T,C or G 



<221> misc_feature 

<222> 348, 351, 352, 355, 357, 361, 370, 387 
<223> n « A,T,C or G 



<400> 99 

nttntttttc cgncnaaagg gcaagngttt ncatctttcc tgnccncnca ananngggtn 60 

tntgtgcntt tnttttttcc caaaacccgg gtnggggaca ccttttgagg anccactnnt 120 

cntccggggc nnnnttttag aaggngncta anaagcntct tgnnggggga aaaacatctt 180 

tttgcncccn acataccccc aagggggggg ggtgtctggg agganactaa ngacttttnt 240 

tttttnnccn caaanaactg anggccccca ttgctccccc cccantcttt aaaaaacccc 300 

ttcaatttcc ttgncnggna aaaanggttg gnaaaaaang agngngcntc nnttncnttt 360 

natggaaggn aaaaggtttt tggttgnaaa accccg 396 

<210> 100 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 229, 286, 303, 312, 334, 335, 348, 350, 357, 364, 371, 395 
<223> n = A,T,C or G 



<400> 100 

ctaacacggt gaaaccctgt 
gggcacctgt agtcccagct 
ggcggagctt gcagtgagct 
gactcccgct caaaaaaaaa 
gggccctatc ccctccttgg 
tgngattttg gnaacatatg 
cctntgactg nggaaaaatc 

<210> 101 
<211> 396 
<212> DNA 
<213> Homo sapiens 



ctctactaaa aatacaaaaa 
gctcaggaag ctgaggcagg 
gagatcgtgt cagtgcactc 
aaaaaaaaga gaaaagaaaa 
ggatcaatga gacccctttt 
gcactggtgc ttcnnggaat 
cagcaggagg cccana 



aattagccag gcgtggtggc 60 
agaatggcgt gaacccagaa 120 

cagcctgggc gacagagcga 180 
agctgcagng agctgggaat 240 
caaaanaaaa aaaaaaataa 300 
tctgtttntn ggcatgnccc 360 

396 



<220> 

<221> misc_feature 

<222> 93, 99, 100, 111, 168, 172, 174, 199, 209, 216, 218, 219, 
227, 242, 243, 269, 272, 297, 300, 301, 308, 315, 317, 323, ' 
331, 341, 344, 348, 357, 359, 363, 364, 366, 376, 379, 386, 
389, 392 

<223> n = A,T,C or G 
<400> 101 

agttataact caacagttca tttatatgct gttcatttaa cagttcattt aaacagttca 60 
ttataactgt ttaaaaatat atatgcttat agncaaaann tgttgtggcg nagttgttgc 120 
cgcttatagc tgagcattat ttcttaaatt cttgaatgtt cttttggngg gntnctaaaa 180 
ccgtatatga tccattttna tgggaaacng aattcntnnc attatcncac cttggaaata 240 
cnnaacgtgg gggaaaaaaa tcattcccnc cntccaaaac tatacttctt ttatctngan 300 
nttcttgntc ctgcncnggt ttngaatata nctgggcaaa nggntttncc aaatccntnt 360 
acnntncttt gggaantanc ggcaantcnt cncttt 396 



<210> 102 
<211> 396 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 17, 93, 136, 183, 317 

<223> n = A,T,C or G 



<400> 102 

actatacata 

cgggtatgga 

ccctgccact 

ctnctcccat 

ctggctctcc 

ctcaaacaca 

tatttgggcc 



agaacangct 
caggtcataa 
tggagnggac 
ctcctacgga 
agcctgctta 
gtaatgnggc 
cgatctgggg 



cacatgggag 
acctagagtc 
ccctctactc 
ctggcgccaa 
ctcctctatt 
actgacccta 
ggggcaaaat 



gctggaggtg 
agngtcctgt 
ttcttagcgc 
atggctttcc 
tttaaagggc 
ccacacctca 
attttg 



ggtacccagc 
tggcctagcc 
ctaccctcat 
tgccaatttt 
caaacaaatc 
tgaagggggc 



tgctgtggaa 60 
catttcagca 120 
acctatctcc 180 
gggatcttct 240 
ccttctcttt 300 
ttgttgcttt 360 
396 



<210> 103 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 91, 174, 176, 188, 201, 214, 254, 277, 299, 325, 349, 355, 

365, 372, 390 

<223> n = A,r,C or G 



<400> 103 

ttgtgttggg 

cccaacttca 

actacaaagg 

cccgaganca 

aatggggctt 

actgggaccc 

taaangctga 



actgctgata 
aagccacagc 
aaaatatggt 
ttgctttccg 
ctgnggatct 
ataacattct 
tnaagcatct 



ggaagatgtc 
tggtatgcca 
ggggtcttct 
ngatagggca 
ccttctggca 
tttgnatcaa 
cgtccgggcn 



ttcaggaaat 
natggtcagg 
tttaccctct 
aaanaaatta 
ttacatnggc 
ccgaagcccc 
ttttat 



gctaaaattg 
ttaaagatat 
tgacttccct 
aaaaacttaa 
aatccctaaa 
cattgttang 



ggcaccctgc 60 
caacctgctg 120 
ttgngngccc 180 
ctggccagtg 240 
aaacaagang 300 
atatngggct 360 
396 



<210> 104 
<211> 396 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 32, 53, 86, 141, 154, 156, 181, 182, 197, 204, 219, 224, 
226, 229, 232, 245, 253, 260, 262, 271, 273, 276, 292, 301, 
303, 305, 321, 325, 332, 343, 352, 382, 392 
<223> n = A,T,.C or G 



<400> 104 

aagggagggc gcgccaagac cttcccactc 

cccagtccaa cttggatacc cttggnttta 

gcaacttgtc aagttctcaa nactgtctct 

nncccccgac gtatttntca aaangtctgc 

ttacnaggtc atnaatttcn cntcaactct 

ngncnccaat tctgtatttt nctcntcaac 

tctccccttc cttattccgg cnttgtttgc 



gngcacactg ggggcgccga cangacgcaa 60 
gttctcggac acttctttta tctctccgtc 120 
ctgngntatc ttttttcttc gctgctcttc 180 
aattgttgna tacntnganc tncaccactg 240 
ntnccncttg ttccctgata tntcggccgg 300 
gntctcactt ttncctcctc cnggccactt 360 
cnccat 396 
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<210> 105 
<211> 396 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> inisc_feature 

<222> 57, 306, 356, 388, 391 

<223> n = A,T,C or G 



<400> 105 

tcaatagcca gccagtgttc atttttatcc 
atttttagtc attgggtcat acagcactaa 
gtttttaatc caggccaaca tgtatgtaaa 
gtactacttg agatttgatt atgagatgtg 
aaataattct ctcctttggt ttgaacctca 
acaagnattt tcattccaca aaacctgagg 
ccccacccca cttgccttgg gtctttanaa 



ttgagctttt agtaaaaact tcctggnttt 60 
agtctgctat ttatggaaac taactttttt 120 
ttaaattttt agataattga ttatctcttt 180 
catattgctt tgggaagagc tcgaggaagg 240 
actagataaa ccctaggaat tgttaactgc 300 
cagctctttt gccagagcgt tcctgnaccc 360 
ngagcc 396 



<210> 106 
<211> 396 

<212> DNA 

<213> Homo sapiens 

<400> 106 

gctgtgtagc acactgagtg acgcaatcaa tgtttactcg aacagaatgc atttcttcac 60 
tccgaagcca aatgacaaat aaagtccaaa ggcattttct cctgtgctga ccaaccaaat 120 
aatatgtata gacacacaca catatgcaca cacacacaca cacacccaca gagagagagc 180 

tgcaagagca tggaattcat gtgtttaaag ataatccttt ccatgtgaag tttaaaatta 240 
ctatatattt gctgatggct agattgagag aataaaagac agtaaccttt ctcttcaaag 300 
ataaaatgaa aagcaattgc tcttttcttc ctaaaaaatg caaaagattt acattgctgc 360 
caaatcattt caactgaaaa gaacagtatt gctttg 396 

<210> 107 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 12, 210, 257, 261, 271, 302, 311, 314, 318, 368, 374, 385, 
389, 396 

<223> n = A,T,C or G 



<400> 107 

ttcacagaac anggtggttt attatttcaa 
taaaggagca gaacctgacc cagagcctgc 
tgggccaggc agggccaaag gcagcagaaa 
ctctgctggg tgtgggcagg tgggcatgan 
ggcaaaacct gtcaggnttg ntgaatgtca 
anaaggccgg nggncccncc cttgcccgta 
gttgtttncc ccanggagct gccanggana 

<210> 108 
<211> 396 
<212> DNA 
<213> Homo sapiens 



tagcaaagag ctgaaaaatg tcgggtccca 60 
agtacatttc caccccacag gggtgcaggc 120 
tgggagtaag agactgtgcc cactgagaag 180 
atgatgatga tgtagtgtaa ggaccaggta 240 
nagtggatcc aaaaggctga gggggtcgtc 300 
tgggccttca aaaagtatgc ttgctcatcc 360 
aggctn 396 
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<220> 

<221> misc^feature 

<222> 280, 281, 286, 305, 311, 313, 323, 326, 327, 340, 352, 356, 
363, 369, 378, 388, 392 
<223> n = A,T,C or G 



<400> 108 

gcctgctttt 

caggtgtgca 

agaaaatgtg 

ggatctcttt 

tgaaatagct 

tttgnaactg 

tgncctggnc 



gatgatgtct 
cagaaaaccg 
gccctaaagg 
ttatttaaat 
t'tactgcttc 
ncnttgattt 
ttttgaanga 



acagaaaatg 
agaatattca 
gggttagttg 
gtgaatttca 
tcacgtgttt 
tcngcnncca 
atacatgngt 



ctggctgagc 
aaattccaaa 
aggggtaggg 
acttttgaca 
tggagaaaan 
agctatatcn 
gntgct 



tgaacacatt 
tttttttctt 
ggtagtgagg 
atcaaagaaa 
natcanccct 
aatatcgtct 



tgcccaattc 60 
aggagcaaga 120 
atcttgattt 180 
agacttttgt 240 
gcaatcactt 300 
gngtanaaaa 360 
396 



<210> 109 
<211> 396 
<212> DNA 

. <213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 237, 279, 284, 291, 305, 307, 
376, 392, 394, 395 
<223> n = A,T,C or G 



308, 313, 326, 343, 351, 366, 



<400> 109 

ggccgtaggc agccatggcg cccagcccgg aatggcatgg tcttgaagcc ccacttccac 60 
aaggactggc agcggcgcgt ggccacgtgg ttcaaccagc cggcccggaa gatccgcaga 120 
cgtaaggccc ggcaagccaa ggcgcgccgc atcgctccgc gccccgcgtc gggtcccatc 180 
cggcccatcg tgcgctgccc acggttcggt accacacgaa gggcgcgccg gcgcggnttc 240 
agcctggagg agctcagggt ggccggattt acaagaagng gccngacatc ngtattcttg 300 
ggatncnnga agnggaacaa gtcacngagt ccttgcagcc acntcagcgg ntgatgacac 360 
cgttcnaact catctnttcc caagaaacct cngnnc 396 

<210> 110 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 1, 2, 12, 13, 16, 18, 29, 39, 60, 66, 70, 86, 90, 104, 121, 
122, 127, 128, 146, 165, 171, 172, 173, 176, 188, 189, 193, 
195, 205, 210, 211, 224, 226, 227, 231, 233, 240, 243, 244v 
248, 249, 255, 257, 258, 260, 266, 268, 272, 273, 275 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 278, 280, 287, 292, 294, 303, 308, 312, 315, 320, 322, 332, 
333, 334, 335, 345, 347, 351, 363, 364, 369, 371, 372, 379, 
381, 382, 386, 391, 393 
<223> n = A,T,C or G 



<400> 110 

nntgggctcc tnncantnat aataaaccng 
tatgtncatn ctgacgcgga aacagngcan 
nnggccnngg tgggcgcatt cccggnggag 



actcatacnc cacaaggaga tgaacaggan 60 
ggagctgagg aggngccaag atgagaccta 120 
ggggccacta aggantacga nnntcnagcg 180 
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gctcttgnng gcngncctcc tcacncctgn ntattcgatt gtcncnnatg ncntcctatn 240 
atnntcanna ttctntnntn atctcntnta cnncntcncn ttcatgntta cngntccctc 300 
tcnttctnac cnttntctgn anctcctttc tnnnnctttc atctntnttc ngctttcttt 360 
ctnnaatcnt nntttaacnt nntctncttt ntnatt 396 

<210> 111 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 

<222> 4, 1, 11, 16, 19, 25, 26, 30, 33, 39, 54, 60, 69, 75, 81, 
99, 102, 130, 132, 143, 154, 156, 166, 180, 182, 188, 190, 
192, 194, 198, 201, 226, 242, 253, 261, 264, 295, 305, 313, 
315, 320, 323, 325, 330, 334, 337, 340, 344, 348, 349 
<223> n = A,T,C or G 

<221> mis cofeature 

<222> 351, 352, 357, 358, 359, 361, 362, 381, 387, 388, 389, 394 
<223> n » A,T,C or G 

<400> 111 

taangancat nctggnttnt gcctnnccgn ctnattgant gttaaaggca attntgtggn 60 

tgtcccagng aatgncggct nattttcttt ccacattgng cncattcact cctcccactc 120 

ttggcatgtn gngacataag canggtacat aatngnaaaa atctgnattt ctgatgccan 180 

angggtanan cntnttgnat ntcattccat tgatatacag ccactntttt atttttgatc 240 

ancggccttc ggntcactgc ncanggtact tgacctcagt gtcactatta tgggntttgg 300 

tttcnctctt ttncnggccn ttntntttcn cacnttncan cttncttnnt nnaaaannna 360 

nncactctct cttgctctct ngatacnnng tctnaa 396 

<210> 112 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__feature 

<222> 172, 186, 378, 380, 382, 388 

<223> n = A,T,C or G 

<400> 112 

tcaacgtcac caattactgc catttagccc acgagctgcg tctcagctgc atggagagga 60 
aaaaggtcca gattcgaagc atggatccct ccgccttggc aagcgaccga tttaacctca 120 
tactggcaga taccaacagt gaccggctct tcacagtgaa cgatgttaaa gntggaggct 180 
ccaagnatgg tatcatcaac ctgcaaagtc tgaagacccc tacgctcaag gtgttcatgc 240 
acgaaaacct ctacttcacc aaccggaagg tgaattcggg gggctgggcc tcgctgaatc 300 • 
acttggattc cacattctgc tatgcctcat gggactcgca gaacttcagg ctggccaccc 360 
tgctcccacc atcactgntn gncaatantc acccag 396 

<210> 113 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 1, 2, 3, 4, 7, 8, 9, 10, 11, 65, 273, 279, 280, 289, 321, 
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338, 380 

<223> n « ArT,C or G 



<400> 113 

nnnnttnnnn nggagcctta atttcagagt 
atggntatac aattttctct cattcagttt 
gaataccttt accaccagat tagaacagta 
gtcttacaaa taaaaacaca tttaaaatag 
catatatttt atatcatggt tacttatgct 
gatggaaata tgggaaaact ntattcatgc 
aatcccgttt ttattttggn aatgattcat 



tttattgtat tgcactaaag gaacagcagg 60 
tgaaaatctg tagtacctgc aaattcttaa 120 
agcataataa ccaatttctt aataagtaat 180 
ctttaaatgc attcttcaca agtaattcag 240 
tangaattnn agcaggatnt ttattctttt 300 
atatacangg ataatattca gcgaagggaa 360 
atataa 396 



<210> 114 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<222> 40, 82, 114, 116, 146, 164, 166, 174, 185, 212, 215, 219, 
224, 236, 242, 254, 258, 263, 270, 286, 299, 308, 327, 328, 
329, 345, 363, 378, 382, 385 
<223> n = A,T,C or G 



<400> 114 

aaatgggaca 

acaagcattt 

gacgccacgg 

tcgcnatgtg 

anaactacaa 

tcttttcntc 

tgnacttctg 



acgtgattct 
ggactctaag 
ttccctccca 
ccccttatct 
ctgnaatntc 
ccctggagtg 
aacaaganca 



tttgttttaa 
gnttagaact 
ccccgngatc 
atagccttcc 
tcnaacggtn 
tgagtgnnng 
anacnatggc 



ataaatactn 
ggagagtctt 
aagacacgga 

cnggncatnt 
atggtccGca 
aggaagaagc 
cccccc 



agaacacgga 
acccatgggc 
atcngntggc 
acangcagga 
ccgatnaaga 
ccttncctta 



cttggctcct 60 
cccncncagg 120 
gatngttgga 180 

tgcggntggg 240 
ttctacctng 300 
catcaccttt 360 
396 



<210> 115 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 277, 297, 321, 341, 381, 391 

<223> n = A,T,C or G 



<400> 115 

ccgcctggtt cggcccgcct gcctccactc ctgcctctac catgtccatc agggtgaccc 60 
agaagtccta caaggtgtcc acctctggcc cccgggcctt cagcagccgc tcctacacga 120 
gtgggcccgg ttcccgcatc agctcctcga gcttctcccg agtgggcagc agcaactttc 180 
gcggtggcct ggcggcggct atggtggggc cagcggcatg ggaggcatca cccgcagtta 240 
cggcaaccag agcctgctga gccccttgcc tggaggngga ccccaacatc aagccgngcg 300 
cacccaggaa aaggagcaga ncaagaccct caacaacaag nttgcttctt catagacaag 360 
ggaccggtcc ttgaacagca naacaagatg ntggag 396 

<210> 116 
<211> 396 
<212> DNA 

<213> Homo sapiens 



<220> 
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<221> misc^feature 

<222> 267, 290, 343, 351, 376 

<223> n « A,T,C or G 



<400> 116 

atctcagttt actagctaag tgactttggg caagggattt aacctctcgt ccctcagttt 60 
cctcctatgt aaaatgacaa ggataatagt accaacccaa tgtagattaa atgagtttac 120 
gaagtgttag aatagtgctt ggcacattag tgctttacaa ctgctatttt gattgttgtt 180 
gtgggctctc tcaaatgcat tgtctctaga tgccagtgac ccaggtcaaa atttaccttt 240 
aaccaagctg catgtttccc agactgntgc acagtcctct accctgagan aaagcttcca 300 
cccaaggata cttttacttt ctgctggaaa act gat gage aanggcaaca ngggacactt 360 
atcgccaact ggaaangaga aattcttcct tttgct 396 

<210> 117 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 

<222> 228, 267, 318, 331, 357, 368, 376 
<223> n = A,T,C or G 



<400> 117 
aaacattttt 
tttcccatat 
tgctggatgt 
tattgcacat 
aacttaaaga 
\ aatcacatta 
tgaattcnga 



taataaaatt 
caccgaggat 
tgccaggact 
attaacatta 
tttccgtggg 
ttacaagntt 
cttggnttta 



cctatagaaa 
tgagagctcc 
cagagagatc 
cttgccccta 
gcgcganacc 
taataaatac 
ttccataaat 



gctcagtcat 
caatattctt 

acccatttac 
gcaccctaaa 
atttcaattt 
nggaagaaga 
acggtt 



agggcaaata 
tggagaataa 

acattcaaac 
tatatggnac 
gaactaatat 
gctggcattt 



ctcagttctc 60 
gcagtagttt 120 

cagtagttcc 180 
ctcaacaaat 240 
ccttgaaaaa 300 
ttctaanatc 360 
396 



<210> 118 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> ndsc^feature 

<222> 4, 5, 12, 14, 15, 16, 24, 59, 80, 87, 225, 280, 286, 287, 
295, 297, 298, 337, 349, 362, 375, 387, 394 
<223> n = A,T,C or G 



<400> 118 

accnncacct 

cctaacactc 

aaatctttat 

ttggaagaaa 

ccaacaagag 

tttaaaaatg 

angaatcccc 



gntnnntttt 
tctttactgn 
ttccaaaatg 
gagacttgat 
catggtaccc 
accccactgg 
cttanggact 



aacnattaca 
ctcaagngga 
atttgttagc 
gcttcttata 
attcttacca 
gcccaatcca 
ttatctnggc 



acttctttat 
agtccaaaca 
caaaagaact 

aattccccat 
ttaacctggn 
acatganacc 
tganaa 



<210> 119 
<211> 396 
<212> DNA 
<213> Homo sapiens 



atggcagttt 
aatttcattt 
ataaaccacc 
tgcanacaaa 
tttaannctc 
taggggggnt 



ttactgggng 60 
ttgtagtaaa 120 
taacaagact 180 
aaataacaat 240 
caaancnnga 300 
tgccttgatt 360 
396 



<220> 
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<221> misc_feature 

<222> 251, 281, 298, 301, 308, 326, 332, 337, 351, 358, 362, 388, 
394 

<223> n = A,T,C or G 



<400> 119 

atggccagct 

tcctgcagaa 

gacccacaat 

tacagctgga 

tgaagcccaa 

ntactcanga 

angcaggaat 



cactttaaat 
ggttgtggga 
ggagagaagc 
aacctgaaga 
ngatgtggcc 
gagaaagcct 
gtgcatatta 



accacctcaa 

aaagcttcta 
cctttatgtg 
accaccggcg 
gtcctttgct 
cattantgcc 
aaaagctncc 



gactcatcga 
tgtgctgcag 
ccatgagtct 
catccacaca 
gagtattcta 
antctgnggg 
ttgnac 



aatgaccgct 
aggctgaagg 
ggctgtggta 
ggagagaaac 
ncttcgaaaa 
aaaaccttct 



ccttcatctg 60 
tgcacatgag 120 
agcagtttac 180 
ctttcctttg 240 
catctggngg 300 
ntcagagngg 360 
396 



<210> 120 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__feature 

<222> 261, 263, 265, 272, 273, 288, 308, 310, 330, 379 
<223> n = A,T,C or G 



<400> 120 
catgggtcag 
tgaaatgtag 
tacattatgc 

acagactgct 
ttattgttaa 
atctctgngn 
atctagatgt 



tcggtcctga 
aagactggaa 
ttggcatgaa 

gttagaaggt 
ttgagttaca 
agaacatttg 
attgtaccna 



gagttcgaag 
aattaagaca 
taaggtgcca 
atgggatcat 
nangncattc 
attccattgn 
aaggggaaaa 



agggcacatt 

ttatgtaaag 
ggaaaacagt 
attaagataa 
annactgagt 
gaagaatgca 
ataaca 



cccaaagaca 
gtagatatgg 
ttaaaattat 
tctgcagctc 
ttatagancc 
gtttaaaata 



ttcccagtca 60 
cttttagagt 120 
acatcagcat 180 
tactacgcat 240 
atattgctct 300 
tctgaatgcc 360 
396 



<210> 121 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 77, 125, 130, 142, 155, 162, 166, 176, 204, 227, 242, 243, 
245, 246, 249, 251, 252, 265, 279, 306, 310, 314, 336, 341, 
354, 367, 382, 385, 390, 395 
<223> n = A,T,C or G 



<400> 121 

tttttttttt 

aaaagggtta 

caagncatan 

ataactgtaa 

tnntnncana 

ttctcngggn 

aaggccngat 



ttttttttaa 
ataaacnttt 
aaaccagatt 

tccccaaggg 
nncagactct 
tgcncatggg 
gcagggtcaa 



aatcaagtta 
actacatggc 
cnaatgccct 

gaanagggtt 
taccntcttt 
aagatggctt 
anagnttccn 



tgtttaataa 
aaattatttt 
taaanaattt 

gggtatgaca 
ctgctgtgnc 
tggacntaac 
ccatnt 



acattaataa 
agctagaatg 
tnaaanatcc 
ggtacanggg 
accctcaggc 
nacacccttt 



atgtttactt 60 
cttttggctt 120 
attgangggg 180 
gccagcccag 24 0 
attggctcca 300 
tgtncacgta 360 
396 



<210> 122 
<211> 396 
<212> DNA 
<213> Homo sapiens 
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<400> 122 

gtcgacatgg ctgccctctg ggctcccaga 
agctcaagcc tgggcctttg aatccggaca 
tgcactttac aaccactgca. ctacctgact 
ggaaccagac ctcatcagcc caacatcaaa 
cccaccccgc accggcgact ccatcttcat 
cagccaccac atcatcccag agctgagctc 
tccctcttct tctttttcat ccttctgtct 

<210> 123 
<211> 396 
<212> DNA 
<213> Homo sapiens 



acccacaaca tgaaagaaat ggtgctaccc 60 
caaaaccctc tagcttggaa atgaatatgc 120 
caggaatcgg ctctggaagg tgaagctaga 180 
gacaccatcg gaacagcagc gcccgcagca 240 
ggccaccccc tgcggtggac ggttgaccac 300 
ctccagcggg atgacgccgt ccccaccacc 360 
ctttgt 396 



<220> 

<221> misc_feature 

<222> 74, 94, 142, 149, 194, 219, 233, 279, 316, 335, 368 
<223> n = A,T,C or G 



<400> 123 

gccctttttt tttttttttt tttcctagtg 
catacacagc acanaggcac gggcaccatg 
gatcttggcc tcacggtgta anaagggana 
ctagggaacc cagnagcaaa tcccaccacg 
cttggtgacg tttagttcca accattatag 
ccattacagg gtgaanatat aaacagtaaa 
aggagcanat gacaccatca aaagcatatg 

<210> 124 
<211> 396 
<212> DNA 
<213> Homo sapiens 



ccaggtttat tccctcacat gggtggttca 60 
gganagggca gcactcctgc cttctgaggg 120 
ggatggtttc tcttctgccc tcactagggc 180 
ccttccatnt ctcagccaag ganaagccac 240 
taagtggana agggattggc ctggtcccaa 300 
ggaanataca gtttggatga ggccacagga 360 
caggga 396 



<400> 124 

gaccattgcc ccagacctgg aagatataac 
ttcctccctt acagagcata caacagaggg 
tccaatttca cgcttttaat tctcatttgt 
taatctccat ttcaaaacca aggaagcagc 
aggctgagtc cagagcttgt gctcctcttg 
tcttgcctgt ctggctcagg gtcaaagaca 
tcaggctact cattcagtcc caaatatgta 

<210> 125 
<211> 396 
<212> DNA 
<213> Homo sapiens 



attcagttcc caccatctga ttaaaacaac 60 
ggcacccggg gaggagagca catactgtgt 120 
tctcacacca acagtgtgaa gtgcgtggta 180 
ctcagagtgg tcgagtgaca cacctcacgc 240 
attcctggtt tgactcagtt ccaggcctga 300 
gaatggtgga gtgtagcctc cacctgatat 360 
ttttcc 396 



<220> 

<221> misc_f eature 

<222> 43, 88, 91, 94, 139, 141, 150, 163, 193, 202, 212, 215, 222, 
238, 253, 256, 286, 297, 331, 343, 350, 360, 376, 385, 396 
<223> n = A,T,C or G 



<400> 125 

cccttttttt tttttttttt tttttttttt 

attaagtcaa attaaaaaac ttcatgcncc 

agttatactt agcttgagng naaaacttgn 



ttttttactt tgnaacaaaa atttattagg 60 
nccncttgtc atatttacct gaaatgacaa 120 
gccccaaaaa ttntgtttgg aaagcaaaaa 180 
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aataattgat gcncatagca gngggcctga tnccnccaca gngaatgttg tttaaggnct 240 

aacaaacagg ggncancaaa gcatacatta cttttaagct ttgggnccaa ggaaaangtc 300 

attccctacc tccttcaaaa gcaaactcat natagcctgg gcncctaggn ctggagcctn 360 

ttttttcgag tctaanatga acatntggat ttcaan 396 

<210> 126 
<211> 396 

<212> DNA 

<213> Homo sapiens 



<400> 126 

cgcgtcgact 

caacaaaggg 

aaggggccag 

gtgctccctc 

gccccacgac 

ggtcctagac 

acctgttctc 



cgcaagtgga 
cacgaaatga 
ctagacaaag 
agccccgagg 
ctggacacgc 
ctcagcatgc 
accgtcctgg 



atgtgacgtc 
gtcctcaggt 
acaccctaga 
agctgagctc 
tggggctacg 
aagaggccct 
cactgctcct 



cctggagacc 
ggccaccctg 
caccctgacc 
cgtgcccccc 
gctacagggc 
ctcggggacg 
agcctc 



ctgaaggctt 
atcgaccgct 
gccttctacc 
agcagcatct 
ggcatcccca 
ccctgcctcc 



tgcttgaagt 60 
ttgtgaaggg 120 
ctgggtacct 180 
gggcggtcag 240 
acggctacct 300 
taggacctgg 360 
396 



<210> 127 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 15 

<223> n = A/T,C or G 



<400> 127 

tttttttttt 

caatgaatac 

aaatatcctt 

gctaatgaat 

gcatgatcag 

tcacataaac 

agcttaaaag 



ttggnggtaa 
ttcagcaaag 
tataatcgat 
aaaataagta 
ttaagtcact 
attcattgca 
aaaattcagt 



aatgcaaatg 
aaaataatta 
tacactaatc 
aaatgactac 
ctgccacttt 
ggagttacac 
ttaatacaaa 



ttttaaaata 
taatttcaaa 
aatatctaga 
ataaactcaa 
ttaaaataat 
ggctaatcat 
gacatt 



tgtttatttt 
atgcaatccc 
aatatacata 
tttcagggat 
acgattcaca 
tgaaaattat 



gtatgtttta 60 
tggatttgat 120 
gacaaagtta 180 
gagggatcat 240 
tttgcttcaa 300 
gatctttgtt 360 
396 



<210> 128 
<211> 396 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 220, 244, 351, 384 
<223> n « A,T,C or G 

<400> 128 

-gccctttttt ttttttttta aaggcaaata 
taaattgagg agcatccata caggcaagct 
attctgcttt taaaaaggtg ccttaagtta 
aatatataaa aatagatgta tcctggaaga 
tcanaatacg ctttttacac aaagaactac 
acacttagga aaagtgagcc gagcagcctt 
aaagactcca gaaccagccg agtntgtgaa 



aaataagttt attgggatgt aaccccatca 60 
ataaaatctg gaaaatttaa atcaaattaa 120 
accaagcatt ttgataacac attcaaattt 180 
tataatgaan aacatgccat gtgtataaat 240 
aaaaagttac aaagacagcc ttcaggaacc 300 
cacgcaaagc ctccttcaaa naagtctcac 360 
aaagga 396 



<210> 129 
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<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 

<222> 104, 164, 177, 204, 217, 234, 273, 312, 350, 353, 370 
<223> n = A,T,C or G 

<400> 129 

gccctttttt tttttttttt ttttactcag acaggcaata tttgctcaca tttattctct 60 

tgcatcgtaa atagtagcca actcacaaaa ataaagtata caanaatgta atatttttta 120 

aaataagatt aacagtgtaa gaaggaaaat ctcaaaaaaa gcanatagac aatgtanaaa 180 

attgaaatga aatcccacag taanaaaaaa aaaacanaaa agtgcctatt taanaattat 240 

gctacatgtg gaacttaact agaccatttt aanaaagacc aatttctaat gcaaattttc 300 

tgaggttttc anattttatt tttaaaatat gttatagcta catgttgtcn acncggccgc 360 

tcgagtctan agggcccgtt taaacccgct gatcag 396 

<210> 130 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 23, 24, 26, 32, 56, 191, 286, 355 
<223> n « A, T,C or G 

<400> 130 

cgcccttttt tttttttttt tanngnacgt gnctttattt ctggatgata taaaanaaaa 60 

aacttaaaaa acaccccaaa ccaaacacca atggatcccc aaagcgatgt gactccctct 120 

tcccacccgg ataaatagag acttctgtat gtcagtctac cctcccgccc ccataacccc 180 

ctctgctata nacatactct gggtatatat tactctactc ggcaatagac atctcccgaa 240 

aatagaattc ctgccctgac acctgactct tccctggccg catcanacca cccgccactg 300 

tagcacactg gtgtccttgc cccctgtggt cagggccatg ctgtcatccc acaanaaggc 360 

cacatttgtc acatggctgc tgtgtccacc gtactt 396 

<210> 131 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 49, 68, 69, 83, 88, 93, 136, 140, 154, 158, 166, 167, 168, 
170, 172, 173, 187, 226, 239, 241, 247, 257, 259, 271, 293, 
301, 318, 334, 336, 342, 344, 357, 377, 384 
<223> n = A,T,C or G 

<400> 131 

gccctttttt tttttttttt tttttttttt ttcagtttac acaaaaacnc tttaattgac 60 
agtatacnnt tttccaaaat atnttttngt aanaaaatgc aataattatt aactatagtt 120 
tttacaaaca agtttntcan taaattccag tgtncttnaa accccnnncn annaaaacat 180 
atatganccc ccagttcctg ggcaaactgt tgaacattca ctgcanacaa aaagaccanc 240 
nccaaanagt catctgngnc ctccatgctg ngtttgcacc aaacctgagg gancagctag 300 
ngaccgtgac aaaagctntg ctacagtttt actntngccc tntntgcctc ccccatnatg 360 
tttccttggt ccctcantcc tgtnggagta agttcc 396 
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<210> 132 
<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> -69 

<223> n = A, T,C or G 



<400> 132 

cgcgtcgacc gcggccgtag cagccgggct ggtcctgctg cgagccggcg gcccggagtg 60 
gggcggcgnt atgtaccttc cacattgagt attcagaaag aagtgatctg aactctgacc 120 
attctttatg gatacattaa gtcaaatata agagtctgac tacttgacac actggctcgg 180 
tgagttctgc tttttctttt taatataaat ttattatgtt ggtaaattta gcttttggct 24 0 
tttcactttg ctctcatgat ataagaaaat gtaggttttc tctttcagtt tgaattttcc 300 
tattcagtaa aacaacatgc tagaaaacaa acttttggaa aggcattgta actatttttt 360 
caaatagaac cataataaca agtcttgtct taccct 396 

<210> 133 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 

<222> 1, 17, 18, 20, 21, 25, 26, 30, 31, 40, 44, 45, 46, 51, 52, 
66, 67, 68, 74, 89, 109, 122, 166, 193, 214, 218, 266, 269, 
291, 307, 315, 348, 375, 378, 379, 386, 393 
<223> n = A,T,C or G 



<400> 133 

ntattacccc tcctggnnan ntggnnatan nctgcaaggn gatnnncccg nngaacttca 60 
ctgatnnncc aatnaaaact gctttaaanc tgactgcaca tatgaattnt aatacttact 120 
tngcgggagg ggtggggcag ggacagcaag ggggaggatt gggaanacaa tagacaggca 180 
tgctggggat gcngcgggct ctatggcttc tgangcgnaa agaaccagct ggggctctag 240 
ggggtatccc cacgcgccct gtagcngcnc attaaacgcg gcgggtgtgg nggttacttc 300 
gcaaagngac cgatncactt gccagcgccc tagctgcccg ctcctttngc tttcttccct 360 
tcctttctcg ccacnttnnc cggctntccc cgncaa 396 

<210> 134 
<2n> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 133, 144, 221, 229, 302, 358 

<223> n = A,T,C or G 



<400> 134 

tttttttttt ttctgctttt tatatgttta 

aaagaaagat tactttcttc cctacaagat 

ctttacaatg ganacacctg gcanacacca 

agtaatagaa ctgatcaata tcttgtgcct 

atcatgccat gatagtcttg ccaaaagtgc 

anacaaactc aaattgaagg acattctaca 

taaaggagac ttaaaagaca tggcaacaat 



aaaatctctc attctattgc tgctttattt 60 
ctttattaat tgtaaaggga aaatgaataa 120 
tcttaaccaa agcttgaagt taacataacc 180 
cctgatatgg ngtactaana aaaacacaac 240 
ataacctaaa tctaatcata aggaaacatt 300 
aagtgccctg tattaaggaa ttattcanag 360 
gcagta 396 
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<210> 135 
<211> 396 
<212> DNA 

<213> Homo sapiens 



<400> 135 

gcgtcgacgc tggcagagcc acaccccaag tgcctgtgcc cagagggctt cagtcagctg 60 

ctcactcctc cagggcactt ttaggaaagg gtttttagct agtgtttttc ctcgctttta 120 

atgacctcag ccccgcctgc agtggctaga agccagcagg tgcccatgtg ctactgacaa 180 

gtgcctcagc ttccccccgg cccgggtcag gccgtgggag ccgctattat ctgcgttctc 240 

tgccaaagac tcgtgggggc catcacacct gccctgtgca gcggagccgg accaggctct 300 

tgtgtcctca ctcaggtttg cttcccctgt gcccactgct gtatgatctg ggggccacca 360 

ccctgtgccg gtggcctctg ggctgcctcc cgtggt 396 

<210> 136 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 18, 185, 188, 191, 193, 396 

<223> n = A,T,C or G 



<400> 136 

ttatgcttcc ggctcgtntg ttgtgtggaa 
acagctatga ccatgattac gccaagctat 

gcatcaagct tggtaccgag ctcggatcca 
gcggncgntc nantctagag ggcccgttta 
tagttgccag ccatctgttg tttgcccctc 
cactcccact gtcctttcct aataaaatga 
tcattctatt ctggggggtg gggtggggca 

<210> 137 
<211> 396 
<212> DNA 

<213> Homo sapiens 



ttgtgagcgg ataacaattt cacacaggaa 60 

ttaggtgaca ctatagaata ctcaagctat 120 

ctagtaacgg ccgccagtgt gctggaattc 180 

aacccgctga tcagcctcga ctgtgccttc 240 

ccccgtgcct tccttgaccc tggaaggtgc 300 

ggaaattgca tcgcattgtc tgagtaggtg 360 

ggacan 396 



<220> 

<221> misc_feature 
<222> 156, 216 
<223> n'«= A,T,C or G 



<400> 137 

tttttttttt ttctgctttg tacttgagtt tatttcacaa aaccacggag aaagatactg 60 
aaatggagct ctttccagcc tccaagcaag gaggccccag cagccagtct ccagcccctt 120 
gagccctttt tgttaggccc acacccaaaa gagganaacc agtgtgtgcg cgaaggtaca 180 
tggcaaggca cttttgaaaa catcccagtt taccgnggtg aaattgaact tactctgaaa 240 
cagatgaaaa gggacatgca aaattgctga gcacatggag gtgtttgtta gtaggtgaaa 300 
atcatgtcct gggtataacc cagcttctcc aggttagggt gagccgccgt ctggatcagt 360 
ggtggcgggc cacacaccag gatgagcgtg gacttc 396 

<210> 138 
<211> 396 
<212> DNA 
<213> Homo sapiens 
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<220> 

<221> misc_feature 
<222> 69, 136, 265, 272 
<223> n = A,T,C or G 



<400> 138 

cccttttttt 

ctcctttana 

acacactact 

gccactgtaa 

ctcttgagca 

tcggcaaaat 

gaagccagtg 



ttttttttac 
attttacaga 
taaggngctt 
atgtaagtat 
gcagctcccc 
tcgtggacat 
agacaagact 



aaatgagaaa 

ctaaagcaca 
gcttactcta 
tattaaaaat 
ccaanaatgc 
aatcatcttt 
tcaactttcc 



aatgtttatt 

acccgaaggc 
caactggaaa 
tacaaattgt 
ancaatggta 
caccattaca 
tatatc 



aagaaaacaa 
aattacagtt 
gttgctgaag 
ttggtgatta 
tgtggctcac 
gataaaccat 



tttagcagct 60 

tcaatcatta 120 
tttgtgacat 180 
ttttgatgac 240 
cagctccata 300 
attcctgaag 360 
396 



<210> 139 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> raisc_feature 

<222> 51, 105, 126, 147, 210, 212, 236, 241, 258, 263, 348 
<223> n = A,T,C or G 



<400> 139 
ccgccctttt 
gctgaaagga 
tggtanattt 

ggtaaaaaga 
natccaacat 
tgagtgacac 
aggtcacttg 



tttttttttt 
ttccagttcc 
tgcttctctt 

agggcaaagc 
caagctanac 
aggccatact 
gggacgagca 



<210> 140 
<211> 396 
<212> DNA 
<213> Homo sapiens 



ttcacaaaag 
aagcagtcaa 
tggtcanaaa 

aaactggaan 
acnccctcgc 
acatttgcaa 
ggcaaccaca 



cactttttat 
aactcaaccg 
agggtattca 

anacttctac 
tggccactct 
ggaaaaaaat 
gcttca 



ttgaggcaaa 
ttagnggcac 
ggttgtactt 
tctactgaca 
acaggttgct 
gaggcaanaa 



nagaagtctt 60 
tattttgacc 120 
tccccagcag 180 
gggctnttga 240 
gtcccactgc, 300 
acacaggtat 360 
396 



<220> 

<221> misc_feature 

<222> 50, 60, 63, 100, 133, 135, 172, 183, 190, 196, 220, 240, 
262, 266, 273, 278, 293, 327, 332, 34l', 348, 355, 380, 391 
<223> n =» A,T,C or G 



<400> 140 

tttttttttt tttttttttt tttttttctc 
tgngacaaat ttttggtcaa gttgtttcca 
cttaaaactg ccncncccaa aaaaaaaaac 
ttntgaaggn tttacnatgc attgttatca 
ggccaattga aacaaacagt tntganaccg 
caggtattag ggataatatt catttancct 
tgccagctcc agcagccttn ttgtccactg 

<210> 141 
<211> 396 
<212> DNA 

<213> Homo sapiens 



atttaacttt tttaatgggn ctcaaaattn 60 
ttaaaaagtn ctgattttaa aaactaataa 120 
caaaggggtc cacaaaacat tntcctttcc 160 
ttaaccagtn ttttactact aaacttaaan 240 
ttnttccncc actgattaaa agnggggggg 300 
tntgagcttt ntgggcanac ttggngacct 360 
ntttga 396 
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<400> 141 

acgccgagcc 

gtcgtattgg 

ccatcaatga 

cccatggcaa 

atcccatcac 

ctgagtacgt 

tgcagggggg 



acatcgctca 
gcgcctggtc 
ccccttcatt 
attccatggc 
catcttccag 
cgtggagtcc 
agccaaaagg 



gacaccatgg 
accagggctg 
gacctcaact 
accgtcaagg 
gagcgagatc 
actggcgtct 
gtcatcatct 



ggaaggtgaa 
cttttaactc 
acatggttta 
ctgagaacgg 
cctccaaaat 
tcaccaccat 
ctgccc 



ggtcggagtc 
tggtaaagtg 
catgttccaa 
gaagcttgtc 
caagtggggc 
ggagaaggct 



aacggatttg 60 
gatattgttg 120 
tatgattcca 180 
atcaatggaa 24*0 
gatgctggcg 300 
ggggctcatt 360 
396 



<210> 142 
<211> 396 
<212> DNA 
<213> Homo sapiens 



<400> 142 

acgcaggaga 

tagaagccct 

ctgtcgaatt 

ggagccagtg 

tgtgtctgca 

gttccccatg 

aattgtagca 



ggaagcccag 
tttctgagca 
aaatcttgca 
ccgctgaaga 
actttgaact 
gatgaagact 
gaattacaag 



cctgttctac 
tcctctcctc 
tcaccatggt 
gtatctctgt 
acgagaatga 
ctgctgttta 
acaagatgaa 



cagagaactt 
tcctcacacc 
gcacttctgt 
gagcgtgaac 
ggagaaagtt 
cagctttgag 
ggcccg 



gcccaggtca 
tgccactgtc 
ggcctactca 
atttacgagt 
cctttggagg 
gccttggtgg 



gaggtctgcg 60 
ctctgcgttg 120 
ccctccaccg 180 
ttgtggctgg 240 
ccttctttgt 300 
atgggaagaa 360 
396 



<210> 143 
<211> 396 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 19, 48, 69, 122, 183, 227, 332, 390 
<223> n = A,T,C or G 



<400> 143 

tttttttttt tttccatana aaataggatt 

atgttccana aatgttttat gcataacaca 
anaaaaatca aagcctacca ggaaatgctt 
ttntgttagc aacacaggaa ttcacattgt 
aggttcaact gctcctaccc ccatgggccc 
gcttcacagt aaccaggagg aaagatctca 
gtgtttcaaa agccaagctc tggggtctgn 

<210> 144 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 221, 331 
<223> n = A,T,C or G 



tattttcaca tttaaggnga acacaaatcc 60 
tcatgagtag attgaatttc tttaacacac 120 
ccctccggag cacaggagct tacaggccac 180 
ctaggcacag ctcaagngag gtttgttccc 240 
tcctcaaaaa cgacagcagc aaaccaacag 300 
gngggggaac cttcacaaaa gccctgagtt 360 
ggcctg 396 



<400> 144 

tttttttttt tttcgctctt tggtctgaca 
tgggaaaaaa ggtcagtttc aaattcagta 
attcttttct ttgggtactt aagccagctg 
gatcaacaac taatcagtta gttcctcagc 
aaggacatac ttgctctggc ttcaattagc 



agaaaagagt tttaggtgtg tgaagtaggg 60 

acatatggta acactaagtt aggctgctgc 120 

gcacttccac tttgtaacca attatattat 180 

ttcaactgaa nagttcctga ttacctgatg 240 

atgctgtcaa gcatccctct ccatgcttaa 300 
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catggcaaca caaaacccaa gagtccttct ntttttttca ttagccatga ataaacactc 360 
acaaagggga agagtagaca ctgcttttag taaacg 396 

<210> 145 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 45, 56, 61, 63, 120, 122, 147, 151, 158, 259, 262, 274, 339, 
345, 353 

<223> n « A,T,C or G 



<400> 145 

tttttttttt tttttttcaa tggatccgtt agctttacta ctaanatctt gctganatca 60 
nanaagggct tctgggcagg ctgagcactg ggggtgtgca acatggtaac tctgaataan 120 
anaaaccctg agttttactg ggcaaanaaa naacaagngg taggtatgat ttctgaacct 180 
ggaaatagcg aaaatgaagg aaattccaaa agcgcgtatt tccaaataat gacaggccag 240 
caagaggaca ccaaacctnt anaaagaggt attntttctt ccagctactg atggctttgg 300 
catcccacag gcacattcct ttggccttca ggatcttana tgcanatgtg ganagtcaag 360 
aggtaggctg actctgagtc ttcagctaaa ttcttt 396 

<210> 146 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 120, 130, 176, 180, 185, 208, 238, 254, 259, 261, 275, 285, 
296, 347 

<223> n = A,T,C or G 



<400> 146 

tttttttttt ttttcattag caaggaagga 
cagccgggat ttttggaaca ctacctttgt 
aataaatcan aagcgacttt aaatctccct 
acaanaaaac agtggcttta caaaaaanat 
gtgaggcaca ctgngggana nacaaggtcc 
ggctgactcc ctgctctccc gcattctctc 
gttgaggtct gatcatttct acccagggaa 

<210> 147 
<211> 396 
<212> DNA 

<213> Homo sapiens 



tttatttttt cttttgaggg gagggcggaa 60 
ctttcacttt gttgtttgtg tgttaacacn 120 
tcgcaggact gtcttcacgt atcagngcan 180 
gttcaagtag gctgcacttt gcctctgngg 240 
cctgnaacca gaggngggaa ggacanagct 300 
ctccatgtgt tttgaanagg gaagcaacat 360 
cctgtt 396 



<400> 147 

acggggaagc caagtgaccg tagtctcatc 
agcagacatc attgtcagtg agcttctggg 
gtgcctggat ggagcccagc acttcctaaa 
cacttccttt ctggctccca tctcttcctc 
ggagaaggac cgtgaccctg aggcccagtt 
cttccaccag ctctctgcac cccagccctg 
tatgattgac aacaaccgct attgcacctt 



agacatgagg gaatgggtgg ctccagagaa 60 
ctcatttgct gacaatgaat tgtcgcctga 120 
agatgatggt gtgagcatcc ccggggagta 180 
caagctgtac aatgaggtcc gagcctgtag 240 
tgagatgcct tatgtggtac ggctgcacaa 300 
tttcaccttc agccatccca acagagatcc 360 
ggaatt 396 



<210> 148 
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<211> 396 
<212> DNA 
<213> Homo sapiens 



<400> 148 

acgtcccatg attgttccag accatgactc 

agaagcagag gttatgacag ttatgcagac 

ccttgctttt ccactgtttc ttcctgctgc 

gacatgtagc agctgcaggg tttaccacac 

ccttccccac tttgagaata tggcagcccc 

attgaagtag aagcctcaaa gcagactttt 

aggaagatca tgcttgatac ttagattggt 

<210> 149 
<211> 396 
<212> DNA 
<213> Homo sapiens 



ttcctggttg tgggtttgtt acagagcagg 60 
tttccccctc ctttttctct tttctcttcc 120 
cacctgggcc ttgaattcct gggctgtgaa 180 
gtgggagggc agcccagtac tgtccctctg 240- 
tttcattcct ggcttggggt aggggagacc 300 
ccctttactg tgtgtactcc aggacgaaga 360 
tttccc 396 



<220> 

<221> misc_feature 
<222> 214, 295 
<223> n = A,T,C or G 



<400> 149 

tttttttttt tttaaagagt cacattttat 

taaactcttt tatctttaat tttgagaagt 

tagagccggt aaccaggaca cagatttgga 

tttatgtcat acatttcgct tatttttatc 

ttaaaaggaa aacattctga gtaaatttag 

tctacactaa caggtcagga taagtatgga 

catgcggtcc ccagagccac cctctccaag 

<210> 150 
<211> 396 
<212> DNA 
<213> Homo sapiens 



tcaatgccta tttgtacatg ttactagcaa 60 

tttacaaata cagcaaagca gaatgactaa 120 

aaaataggtc taattggttg ttacactgtg 180 

aaanaaaaat cagaatttat aaaatgttaa 240 

tcccgtgttt cttcctccaa atctntttgt 300 

tggggaggct ggaaaaaggg catccttccc 360 
caggac 396 



<400> 150 

acgcctctct tcagttggca cccaaacatc 
agcatctgga cttttcagaa ttgatcttaa 
acaactgtat ccttggaaat atatttctag 
cctgagtcaa atttcagcag acttaccagc 
actgtatctt gaaaaacctg ttgttccttt 
gactcattgc agtaatattt ccaatgtgcc 
tactgatgta cacatgaagg aagtaattca 

<210> 151 
<211> 396 
<212> DNA 
<213> Homo sapiens 



tggattggca aatcagtggc aagaagttcc 60 

gtctactgtc atttccagat gcattatttt 120 

ggagaatatt attgaagaaa atgttaatag 180 

atttgtatca gtggtagcaa atgaagccaa 240 

aaatatgatg ttgccacaag ctgcattgga 300 

acctacaaga gagatacttc aagtctttct 360 
gcagtt 396 



<220> 

<221> misc_feature 
<222> 146, 299, 332 
<223> n = A,T,C or G 

<400> 151 

acaaaatgcc cagcctacag agtctgagaa ggaaatttat aatcaggtga atgtagtatt 60 
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aaaagatgca gaaggcatct tggaggactt 
acgagaggca atccagcatc cagcanatga 
tgttccacta gtaggcaaat taaagaaatt 
attaagaggt cttctgggag ccttaacaag 
gcgagagcag gctcttgcta aacagtttgc 
tgaactcaag atgacaaatc ctgccataca 

<210> 152 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> inisc_feature 
<222> 249 

<223> n = A, T,C or G 



gcagtcatac agaggagctg gccacgaaat 120 

gaagttgcaa gagaaggcat ggggtgcagt 180 

ttacgaattt tctcagaggt tagaagcagc 240 

taccccatat tctcccaccc agcatctana 300 

anaaattctt catttcacac tccggtttga 360 

gaatga 396 



<400> 152 

acgcagcgct cggcttcctg gtaattcttc 
tgctgggccc tccttgctgc tcgccgccct 
gcgctgcgac acacctgcca actgcaccta 
ggtgggctcc agcggttccc agcgcgatgt 
aaaagtagng gtgtaccttc agaagctgga 
ccatttcacc atcatttaca accaaggctt 
tgcctttttt aagtataaag aagagggcag 

<210> 153 
<211> 396 
<212> DNA 
<213> Homo sapiens 



acctcttttc tcagctccct gcagcatggg 60 

cctgctgctt ctctccggcg acggcgccgt 120 

tcttgacctg ctgggcacct gggtcttcca 180 

caactgctcg gttatgggac cacaagaaaa 240 

tacagcatat gatgaccttg gcaattctgg 300 

tgagattgtg ttgaatgact acaagtggtt 360 

caaggt 396 



<400> 153 

ccagagacaa cttcgcggtg tggtgaactc 

tgactgagac ctagaaatcc aagcgttgga 

atggaacgaa ggcgtttgcg gggttccatt 

acaagcccac ggagacttgt ggagctggca 

gccattgccg ccctggagtt gctgcccagg 

tttgacggga gacacagcca gaccctgaag 

ctccctctgg gagtgctgat gaagggacaa 

<210> 154 

<211> 396 

<212> DNA • 

<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> 42, 45, 59, 82 
<223> n A,T,C or G 



tctgaggaaa aacacgtgcg tggcaacaag 60 

ggtcctgagg ccagcctaag tcgcttcaaa 120 

cagagccgat acatcagcat gagtgtgtgg 180 

gggcagagcc tgctgaagga tgaggccctg 240 

gagctcttcc cgccactctt catggcagcc 300 

gcaatggtgc aggcctggcc cttcacctgc 360 

catctt 396 



<400> 154 

acagcaaacc tcctcacagc ccactggtcc 
acaactacgc attgcctccc tncactcgga 
tggacagtgt cagagtcctg agacagatca 
cctcggacac cgaggagaat gtcaagaggc 
ggaacgagct aaaacggagc ttttttgccc 
atgaaaaggc ccccaaggta gttatcctta 
aagcagagga gcaaaagctc atttctgaag 



tcaagagggg cnacntcttc acacatcanc 60 

aggactatcc tgctgccaag agggtcaagt 120 

gcaacaaccg aaaatgcacc agccccaggt 180 

gaacacacaa cgtcttggag cgccagagga 240 

tgcgtgacca gatcccggag ttggaaaaca 300 

aaaaagccac agcatacatc ctgtccgtcc 360 

aggact 396 
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<210> 155 

<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> iaisc_feature 

<222> 15, 17, 202, 280, 339 

<223> n » A,T,C or G 

<400> 155 

tttttttttt tgaananaca ggtctttaat gtacggagtc tcacaaggca caaacaccct 60 
caccaggacc aaataaataa ctccacggtt gcaggaaggc gcggtctggg gaggatgcgg 120 
catctgagct ctcccagggc tggtgggcga gccgggggtc tgcagtctgt gaggggcctc 180 
ctgggtgtgt ccgggcctct anagcgggtc cagtctccag gatggggatc gctcactcac 240 
tctccgagtc ggagtagtcc gccacgaggg aggagccgan actgcagggg tgccgcgtgt 300 
cgggggtgtc agctgcctcc tgggaggagc ctgctggcna caggggcttg tcctgacggc 360 
tcccttcctg ccccctcggg ctgctgcact tggggg 396 

<210> 156 
<211> 396 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 11, 30, 32, 37, 309, 332 

<223> n = A,T,C or G 

<400> 156 

gaaggggggc ngggcagggg cggaatgtan anattantgc catgattgaa gatttaagaa 60 
acgtgagatt caggattttc accacatccc catttagtta gcttgctcgt ttggctggtg 120 
caaatgccag atggattatg aacaatgaca gtaaattaat gcaacataat caggtaatga 180 
tgccaagcgt atctggtgtt ccaggtattg tacctttacc ggaacaaatc agtaaatcca 240 
caatccctgg cacctgttag gcagctatta acctagtaaa tgctccccca tcccatctca 300 
atcagcaang acaatcaaaa acatttgctt tnagtggcag gaacactggt acatttttac 360 
ttgctccaag ggctgtgcca acgctccctc tctctg 396 

<210> 157 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 121, 202, 204, 255, 314, 332, 368 
<223> n » A,T,C or G 



<400> 157 

tttttttttt tttttgggga atgtaaatct 

taaagctttg gcacatacag tataaaaaat 

nttatcaact gcatactaag tgttttcaat 

aaattgataa ataacaggta ananaaagat 

aaaagtgagt actgnggata tttaaaatat 

gtattgcggg ccanacactt aagtgaaagc 

attttggnca tatcatttca aaacatttgc 



tttattaaaa cagttgtctt tccacagtag 60 
aatcacccac cataattata ccaaattcct 120 
acaatttttt ccgtataaaa atactgggaa 180 
atttctaggc aattactagg atcatttgga 240 
cacagtaaca agatcatgct tgttcctaca 300 
anaagtgttt gggtgacttt cctacttaaa 360 
atcttg 396 
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<210> 158 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<400> 158 

tttccgaaga cgggcagctt cagagaagag 
tagactcttt ggcatagact ctttcgcagg 
ccatccccaa actagctgga gcctgatgga 
aaaaatgttc caaaaagtta tctccagaga 
tattctcatt ctttaaacca atacccaggt 
acatggtgtg gctagaaatg aattgagtgt 
taggaggagg cagaggggtg cctggagttt 

<210> 159 
<211> 396 
<212> DNA 
<213> Homo sapiens 



gattattcgg gagattgctg gtgtggccca 60 
cagccactct gagtgtggcc agttctataa 120 
taggaacggg tagtctgtcc tcttccccat 180 
gagtccctta tgaagacagt tgccaagctg 240 
cagggctagt tcacactagc actgttaggg 300 
gacttctccc tacaacccca ggcccaggga 360 
ctgcac 396 



<400> 159 

tccgcgcgtt 

gcggcgaggg 

cccacactga 

ggagcgtcgg 

cacctgatcg 

catggcgcag 

gctcctctct 



gggaggtgta 
cgcgagtgag 
aggtccggaa 
cacctgaacg 
cgagacccca 
ctgtgcgggc 
ggggtcctgg 



gcgcggctct 
gagcagaccc 
aggcgacttc 
cgaggcgctc 
acggctggtg 
tgaggcggag 
cggccgaccg 



gaacgcgctg 
aggcatcgcg 
cgggggcttt 
cattgcgcgt 
gcgtcgcctg 
ccgggcgttt 
agaacg 



agggccgttg 
cgccgagaag 
ggcacctggc 
gcgcgttgag 
cgcgtctcgg 
ctcgccctgc 



agtgtcgcag 60 
gccgggcgtc 120 
ggaccctccc 180 
gggcttcccg 240 
ctgagctggc 300 
tgggatcgct 360 
396 



<210> 160 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 96, 102, 122, 124, 129, 146, 148, 184, 189, 196, 205, 208, 
229, 246, 259, 261, 269, 272, 281, 297, 305, 308, 327, 331, 
337, 338, 339, 343, 346, 354, 366, 367, 369, 378, 379, 380, 
381, 391, 395 
<223> n = A,T,C or G 



<400> 160 

ggaaaccttc 

atatgtcgta 

angnataana 

tganaattnt 

aaaaanccat 

ttccnganat 

cactgnncnc 



tcaactaaga 
cactctacaa 
acttccttta 
ctatangtct 
ttacattgna 
gatgttcttt 
cacatttnnn 



gaacatcatt 
tgtaaatagc 
caaaanantt 
ctcangantg 
nagtgtacna 
tagtttnaga 
naattacacc 



tctggcaaac 
actganccac 
cctgttgttc 
ttcgcaccca 
cntatatttg 
nggttcnnnc 
ncacng 



tatttttgtt 
ancttacaga 
ttaatactcc 
tttcttttnt 
ngagctaaca 
aanttnctac 



agctcacaat 60 
aggtaaaaag 120 
ccattgctta 180 
aacttctact 240 
aaaaatngtt 300 
tccngcccgc 360 
396 



<210> 161 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 271, 273, 325, 364 
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<223> n = A,T,C or G 



<400> 161 

tttttgtttg 

gcttaaaaca 

tatttgttga 

agaggataag 

ttttatttct 

ctcccaacat 

ctanagttca 



attattttta 
tttatgagta 
tgggaaatca 
cacaaccata 
tggtatacct 
tttaaaaaaa 
gctgagtaag 



<210> 162 
<211> 396 
<212> DNA 

<213> Homo sapiens 



ttataatgaa 
ctgcaaggac 
ttgtgaaagc 
tgggaggaga 
tcccaaaaca 
ctgcnccccc 
ccactgtgga 



attaaactta 
taacagaaac 
aaacctccaa 
taaccagtct 
nanacattca 
caatgggtga 
gcctta 



tgactattac 
aggaaaaatc 
atattcattt 
ctcccttcat 
acagtagtta 
acaaagtaaa 



agtatgctca 60 
ctactaaaaa 120 
gtaagccata 180 
atatattctt 240 
gaatggccat 300 
gagtagtaac 360 
396 



<220> 

<221> misc_feature 

<222> 33, 38, 51, 62, 71, 72, 88, 97, 98, 100, 106, 142, 155, 160, 
161, 163, 168, 170, 174, 183, 190, 194, 203, 214, 216, 231, 
232, 241, 242, 252, 258, 260, 264, 265, 267, 276, 278, 282, 
287, 289, 292, 295, 297, 301, 311, 319, 322, 325 

<223> n = A, T,C or G 

<221> misc_feature 

<222> 330, 337, 341, 342, 347, 348, 354, 356, 361, 367, 368, 375, 
379, 385, 391, 394, 395 
<223> n « A,T,C or G 



<400> 162 

tttttttttt tttttttttt tttttttttt 
angggacaaa nnaaaaaact taaggggntg 
aaaccccaac atgcatgccc tnccttgggg 
aantaacccn aggnttaact ttnattatca 
nnttccccca anccaaantn gggnncnccc 
nattttttga ngggtttcnc cngcncattn 
ngggggnnat ttttntcang ggccnatttg 

<210> 163 
<211> 396 
<212> DNA 

<213> Homo sapiens 



ttnggggncc aaattttttt ntttgaagga 60 
ttttggxincn acttanaaaa aagggaaagg 120 
accanggaan ncnccccncn ggtntgggga 180 
ctgncnccca gggggggctt nnaaaaaaaa 240 
attttncnca anttggncnc cnggncnccc 300 
agggaanggg nntcaannaa accncncaaa 360 
ngcnnt 396 



<400> 163 

cactgtccgg ctctaacaca gctattaagt 
cagtttctga ggtcagacga gtgtctgcga 
tttctgcttt catgctcagc acatcatctt 
ttttcttcca atagaaaatt ccgcttgacc 
tggcccaagt ctgagccgga ggcccttgtt 
gcccttgaag aacaaggctg acctgagagg 
gcccagggta ggcctggcat gaggggttag 

<210> 164 
<211> 396 
<212> DNA 

<213> Homo sapiens 



gctacctgcc tctcaggcac tctcctcgcc 60 

tgtcttcccg cactctattc ccccagcctc 120 

cctaggcagt ctcttcccca aagtctcacc 180 

tttggtgcac tgcccacttc ccagctccac 240 

ttgggggcgg ggggagagtt ggatgtgatt 300 

ttcctggcgc cctgaggtgg ctcagcacct 360 

gtcagc 396 



<400> 164 
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gacacgcggc 
tacgtgccag 
tacaatgact 
acttctgctc 
acagtcacag 
caccacaact 
cagggattca 



ggtgtcctgt 
acgacggact 
ttctcattct 
tgaccaagaa 
aggctgggat 
gtacacctga 
tcacagaccc 



gttggccatg 
cacagcacag 
ccctgggtac 
aatcactctt 
ggccatagca 
attccaggcc 
tgtggtcctc 



gccgactacc 
cagctcttca 
atcgacttca 
aagaccccac 
atggcgctta 
aatgaagttc 
agcccc 



tgattagtgg 
actgcggaga 
ctgcagacca 
tggtttcctc 
caggcggtat 
ggaaagtgaa 



gggcacgtcc 60 
cggcctcacc 120 
ggtggacctg 180 
tcccatggac 24 0 
tggcttcatc 300 
gaaatatgaa 360 
396 



<210> 165 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc feature 

<222> 29, 33, 55, 51, 65, 11, B2, 87, 98, 101, 103, 114, 118, 124, 
169, 171, 173, 183, 186, 188, 216, 219, 227, 230, 242, 243, 
245, 252, 265, 273, 290, 296, 321, 324, 332, 338, 340, 342, 
345, 359, 372, 380 
<223> n = A,T,C or G 



<400> 165 

tttttttttt tttttttttt ttttttcang 
aaccnccggg gatctancct gnggccnccc 
aacncttggg ggaaagggag gcaaaaaaaa 
agntananct gccaacaggg ctccagggag 
gnncnaactc cncggggggg gggcnctaac 
ggcctcaacc tgcctgagct nacncaaggg 
agggcttgcc tncccacagn ttacttggcc 

<210> 166 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 151, 255 
<223> n = A, T,C or G 



ggncactgag gctttttatt ttgancncaa 60 
cggaaatnac ncnaggctca catnactnta 120 
caatgacttg ggccaattnc ncnactgcaa 180 
cttggnttnt gtaaaanttn taaggaagcg 240 
tancagggac ccctgcaagn gttggncggg 300 
gnggggtntn tntanccaac aggggaccna 360 
aagggg 396 



<400> 166 

ttttttcaaa ttcagagcat ttttattaaa 

aatttttcaa atgaaaaccc ttcaaacggt 
aattacggaa taatttaact ttttaaaata 
ttctccccaa ataaatgttt tcttagtttt 
caatattata atgtncactt ccttaaaaat 
gtttaaccag tatattaacc agtatattaa 
gcttggtatt ttgtattgct tattgcatgc 

<210> 167 
<211> 396 
<212> DNA 
<213> Homo sapiens 



agaacaaaat attaaggcac aaaatacatc 60 
tatgtcctac attcaacgaa acttcttcca 120 
naaaaataca agttcttaaa tgcctaaaat 180 
aatgaagtct cttcatgcag tactgagctc 24 0 
ctagttttgc cacttatata cattcaatat 300 
ccaatatgtt aaacttcttt taagtataag 360 
tttgat 396 



<400> 167 

tggcggcagc ggcggtggcg gtggctgagc 
ggacacaatg tttgcacgag gactgaagag 
gggagccctg gccggcttga agacagtgtc 



agaggacccg gcgggcggcc tcgcgggtca 60 
gaaatgtgtt ggccacgagg aagacgtgga 120 
ctcatacagc ctgcagcggc agtcgctcct 180 
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ggacatgtct ctggtgaagt tgcagctttg 
ctcagtcctc attgccaaca cggtccggca 
gtggcgcaca gtggcacccc aggctgcaga 
ggagatcctg tgccgtgcag cgtgggggca 



ccacatgctt gtggagccca atctgtgccg 240 
gatccaagag gagatgacgc aggatgggac 300 
gcgggcgccg ctcgaccgct tggtctccac 360 
agaggg 396 



<210> 168 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<400> 168 

taggatggta agagtattat aaggattggt acaaggcatg atgagtcctt ttgcttttag 60 
gcttttgact tctggtttta gactttcttt agcttctgtt gttagacaac attgtgcaag 120 
cttggttttt ataagtttgc atggattaaa ctgaacttaa tgaaattgtc cctcccccca 180 
aattctcagc acaattttta ■ ggcccacaag gagtcaagca cctcaaggag atcttcagtt 240 
tgaacttggt gtagacacag ggatactgat gaatcaatat tcaaattagc tgttacctac 300 
ttaagaaaga gaggagacct tggggatttc gaggaagggt tcataaggga gattttagct 360 
gagaaatacc atttgcacag tcaatcactt ctgacc 396 

<210> 169 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 

<222> 16, 58, 76, 84, 99, 111, 114, 124, 136, 140, 161, 167, 184, 
189, 204, 206, 210, 228, 230, 232, 243, 275, 277, 289, 301, 
303, 312, 319, 321, 323, 325, 333, 345, 349, 355, 359, 364, 
365, 372, 375, 377, 379, 383, 387, 389, 394, 396 
<223> n = A,T,C or G 

<400> 169 

tttttttttt tttcanaatt aaattcttta atacaaaatg cttttttttt tttaaaanat 60 

atctgtattt ctttgncgtt gttnaaaaat aaatatgtnc tacggaatat ntcnaaaaac 120 

tgcnctaaaa acaaanacgn gatgttaata tcttttcccc ncaattntta cggataaaca 180 

gtanccccna taaataaatg atancnaatn ttaaaattaa aaaagganan anatttagta 240 

tgnaaaattc tctatttttt cttggtttgg ttttncntat aaaaaacana atagcaatgt 300 

ntnttttatc anaatcccnt ntntncctaa acnttttttt ttttntttnc ccccnaatnc 360 

aagnngccaa anatntntnt agnatgnana tgtntn 396 

<210> 170 
<211> 396 
<212> DNA 

<213> Homo sapiens 

<400> 170 

tgagaagtac catgccgctt ctgcagagga acaggcaacc atcgaacgca acccctacac 60 
catcttccat caagcactga aaaactgtga gcctatgatt gggctggtac ccatcctcaa 120 
gggaggccgt ttctaccagg tccctgtacc cctacccgac cggcgtcgcc gcttcctagc 180 
catgaagtgg atgatcactg agtgccggga taaaaagcac cagcggacac tgatgccgga 240 
gaagctgtca cacaagctgc tggaggcttt ccataaccag ggccccgtga tcaagaggaa 300 
gcatgacttg cacaagatgg cagaggccaa ccgtgccctg gcccactacc gctggtggta 360 
gagtctccag gaggagccca gggccctctg cgcaag 396 

<210> 171 
<211> 396 
<212> DNA 
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<213> Homo sapiens 



<220> 

<221> raisc_feature 

<222> 133, 224, 260, 264, 268, 279, 283, 317, 322, 338, 360, 370, 
371, 378 

<223> n = A,T,C or G 



<400> 171 

ggtcctcgtc 

aagtgctaaa 

catcatcatc 

cagtgtctca 

gagagatgtt 

tggcttcccc 

cttggtttgn 



gtggtgagcg 
gccgctgagt 
ctnctcagct 
agaccttgcc 
agaagatggn 
ccttaancag 
naaaaaanct 



<210> 172 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 239, 242, 244, 
385, 387, 388, 395 
<223> n = A,T,C or G 



cagccactca 
gaagtaagaa 
ggtggggtca 
ccaccacgga 
tctnaaanat 
gnaaaacatg 
ggaggcttga 



ggctggtcct 
ctctgctaga 
agtgggaagt 
aagccttttt 
tcctctgcna 
atcagcangt 
caatgg 



gggggtgggg 
gaggaaaatg 
tctgtcactg 
cacntacccc 
atntgttttt 
tgctcggatg 



ctgtagggga 60 
ggcttgcttt 120 
ggatctggtt 180 
aaaggacttg 240 
agctatcaag 300 
gaaaaactan 360 
396 



246, 249, 257, 260, 314, 329, 355, 372, 378, 



<400> 172 

agccttgggc caccctcttg gagcatctgg 
actggagaga atgggcagaa gtcgtggtgt 
aatagcaatg tgtgttcaga gagaatgaat 
tttaaactaa taaagtacta ggttgcaata 
cnantntana gggcccnttn aaacccgttg 
gccatctgtt gttngcccct cccccgtgnc 
gtctttccta anaaaaanga agaantnncc 

<210> 173 
<211> 396 
<212> DNA 
<213> Homo sapiens 



ctgtcgaatt cttgtgaccc tgttacacac 60 
tgcagccctg tgcattgggg gtgggatggg 120 
tgcttaaact ttgaacaacc tcaatttctt 180 
tgtgaaaaaa aaaaaaaaag ggcggccgnt 240 
atcaacctcg actgtgcctt ctagttgcca 300 
tttcttgacc ttgaaagggg ccccncccct 360 
ttccnt 396 



<220> 

<221> misc_feature 

<222> 209, 210, 232, 244, 270, 275, 284, 341, 343, 349, 359, 364, 
368, 376, 380, 382, 388, 389, 390, 392 
<223> n « A,T,C or G 

<400> 173 

aagcatgtgg atatgtttag ctacgtttac tcacagccag cgaactgaca ttaaaataac 60 

taacaaacag attcttttat gtgatgctgg aactcttgac agctataatt attattcaga 120 

aatgactttt tgaaagtaaa agcagcataa agaatttgtc acaggaaggc tgtctcagat 180 

aaattatggt aaaattttgc aggggacann ctttttaaga cttgcacaat tnccggatcc 240 

tgcnctgact ttggaaaagg catatatgtn ctagnggcat gganaatgcc ccatactcat 300 

gcatgcaaat taaacaacca agtttgaatc tttttggggg ngngctatnc tttaacccng 360 

tacnggcntt attatntaan gnccctgnnn cntgtg 396 



<210> 174 
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<211> 924 
<212> DNA 
<213> Homo sapiens 



<400> 174 

cctgacgacc cggcgacggc gacgtctctt 
cctaggagtc tacggggacc gcctcccgcg 
ggaaaatcat ccgatcggaa aacttcgagg 
tgctgaggaa gattgctgtg gctgcagcgt 
gagacacttt ctacatcaaa acctccacca 
ttggggagga gtttgaggag cagactgtgg 
gggagagtga gaataaaatg gtctgtgagc 
cctcgtggac cagagaactg accaacgatg 
acgttgtgtg caccagggtc tacgtccgag 
aagcccacca ctggccatgc tcaccgccct 
cttctaggat agcgctcccc ttaccccagt 
cagggtcttg ctttctttga cctcttctct 
gcaagagccc agatcaccca ttccgggttc 
gccccaaacc agcccagagc agggtctctc 
gactggccct ctagcttcta ccctttgtcc 
tgttaatttt attaaaatgc ttta 

<210> 175 

<211> 3321 

<212> DNA 

<213> Homo sapiens 



ttgactaaaa gacagtgtcc agtgctccag 60 
ccgccaccat gcccaacttc tctggcaact 120 
aattgctcaa agtgctgggg gtgaatgtga 180 
ccaagccagc agtggagatc aaacaggagg 240 
ccgtgcgcac cacagagatt aacttcaagg 300 
atgggaggcc ctgtaagagc ctggtgaaat 360 
agaagctcct gaagggagag ggccccaaga 420 
gggaactgat cctgac.catg acggcggatg 480 
agtgagtggc cacaggtaga accgcggccg 540 
gcttcactgc cccctccgtc ccaccccctc 600 
cacttctggg ggtcactggg atgcctcttg 660 
cctcccctac accaacaaag aggaatggct 720 
actccccgcc tccccaagtc agcagtccta 780 
taaaggggac ttgagggcct gagcaggaaa 840 
ctgtagccta tacagtttag aatatttatt 900 

924 



<400> 175 

atgaagattt tgatacttgg tatttttctg 
gaaaagcatt attacattgg aattattgaa 
gaaaagaaac ttatttctgt tgacacggaa 
gatagaattg ggagactata taagaaggcc 
aggacaacta tagaaaaacc ggtctggctt 
actggagata aagtttatgt acacttaaaa 
tcacatggaa taacttacta taaggaacat 
gattttcaaa gagcagatga caaagtatat 
gccactgaag aacaaagtcc tggggaagga 
tcccacattg atgctccaaa agatattgcc 
aaaaaagatt ctctagataa agaaaaagaa 
ttttctgtgg tggatgaaaa tttcagctgg 
tcagaaccag agaaagttga caaagacaac 
tctgtgaatg gatacacttt tggaagtctc 
gtaaaatggt acctttttgg tatgggtaat 
gggcaagcac tgactaacaa gaactaccgt 
ctgtttgatg cttatatggt ggcccagaac 
ctaaaccatc tgaaagccgg tttgcaagcc 
tcatc^aagg ataatatccg tgggaagcat 
atcatctgga actatgctcc ctctggtata 
cctggaagtg actcagcggt gttttttgaa 
aaaaagctgg tttatcgtga gtacacagat 
cctgaagaag agcatcttgg catcctgggt 
atcagagtaa ccttccataa caaaggagca 
agattcaata agaacaacga gggcacatac 
agtgtgcctc cttcagcctc ccatgtggca 
gtccccaaag aagtaggacc cactaatgca 
tctgctgtgg atcccactaa agatatattc 
aagaaaggaa gtttacatgc aaatgggaga 
tttcctacag tatttgatga gaatgagagt 



tttttatgta gtaccccagc ctgggcgaaa 60 
acgacttggg attatgcctc tgaccatggg 120 
cattccaata tctatcttca aaatggccca 180 
ctttatcttc agtacacaga tgaaaccttt 240 
gggtttttag gccctattat caaagctgaa 300 
aaccttgcct ctaggcccta cacctttcat 360 
gagggggcca tctaccctga taacaccaca 420 
ccaggagagc agtatacata catgttgctt 480 
gatggcaatt gtgtgactag gatttaccat 540 
tcaggactca tcggaccttt aataatctgt 600 
aaacatattg accgagaatt tgtggtgatg 660 
tacctagaag acaacattaa aacctactgc 720 
gaagacttcc aggagagtaa cagaatgtat 780 
ccaggactct ccatgtgtgc tgaagacaga 840 
gaagttgatg tgcacgcagc tttctttcac 900 
attgacacaa tcaacctctt tcctgctacc 960 
cctggagaat ggatgctcag ctgtcagaat 1020 
tttttccagg tccaggagtg taacaagtct 1080 
gttagacact actacattgc cgctgaggaa 1140 
gacatcttca ctaaagaaaa cttaacagca 1200 
caaggtacca caagaattgg aggctcttat 1260 
gcctccttca caaatcgaaa ggagagaggc 1320 
cctgtcattt gggcagaggt gggagacacc 1380 
tatcccctca gtattgagcc gattggggtg 1440 
tattccccaa attacaaccc ccagagcaga 1500 
cccacagaaa cattcaccta tgaatggact 1560 
gatcctgtgt gtctagctaa gatgtattat 1620 
actgggctta ttgggccaat gaaaatatgc 1680 
cagaaagatg tagacaagga attctatttg 1740 
ttactcctgg aagataatat tagaatgttt 1800 
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acaactgcac ctgatcaggt ggataaggaa 
cactccatga atggattcat gtatgggaat 
tcggtcgtgt ggtacttatt cagcgccgga 
tcaggaaaca catatctgtg gagaggagaa 
acaagtctta cgctccacat gtggcctgac 
acaactgatc attacacagg cggcatgaag 
cagtctgagg attccacctt ctacctggga 
gtggaatggg attattcccc acaaagggag 
cagaatgttt caaatgcatt tttagataag 
aaagttgtgt atcggcagta tactgatagc 
gaagaagaac atctgggaat tctaggtcca 
aaaattatct ttaaaaacat ggccacaagg 
acagagagtt ctacagttac tccaacatta 
atcccagaaa gatctggagc tggaacagag 
tcaactgtgg atcaagttaa ggacctctac 
cgaagacctt acttgaaagt attcaatccc 
ctagtttttg atgagaatga atcttggtac 
caccccgaga aagtaaacaa agatgatgag 
attaatggaa gaatgtttgg aaacctacaa 
aactggtatc tgatgggaat gggcaatgaa 
catagcttcc aatacaagca caggggagtt 
ggaacatacc aaaccctaga aatgtttcca 
catgtgaccg accacattca tgctggaatg 
gacaccaaat ctggctgaat gaaataaatt 
tgattcataa caatgtatgt gaaagtgtaa 
cattaaaaga gactggagca t 

<210> 176 
<211> 487 
<212> DNA 

<213> Homo sapiens 
<400> 176 

gaaatacttt ctgtcttatt aaaattaata 
ttacagcaga catggaaata taattttaaa 
tcaccactgt tatattacct tctccaggaa 
tttccttgta tgcaaagttt ttgttgaaag 
ggagaaaact gcatcataac tttacagaat 
aaacttctct gcagtatctg gcttgtccat 
catgagtcag tttgtgccca tgaataatac 
gtatttttaa ggtcaatata ctgtacattt 
aaaaaaa 

<210> 177 
<211> 3999 
<212> DNA 

<213> Homo sapiens 



gatgaagact ttcaggaatc taataaaatg 1860 
cagccgggtc tcactatgtg caaaggagat 1920 
aatgaggccg atgtacatgg aatatacttt 1980 
cggagagaca cagcaaacct cttccctcaa 2040 
acagagggga cttttaatgt tgaatgcctt 2100 
caaaaatata ctgtgaacca atgcaggcgg 2160 
gagaggacat actatatcgc agcagtggag 2220 
tgggaaaagg agctgcatca tttacaagag 2280 
ggagagtttt acataggctc aaagtacaag 2340 
acattccgtg ttccagtgga gagaaaagct 24 00 
caacttcatg cagatgttgg agacaaagtc 24 60 
ccctactcaa tacatgccca tggggtacaa 2520 
ccaggtgaaa ctctcactta cgtatggaaa 2580 
gattctgctt gtattccatg ggcttattat 2640 
agtggattaa ttggccccct gattgtttgt 2700 
agaaggaagc tggaatttgc ccttctgttt 2760 
ttagatgaca acatcaaaac atactctgat 2820 
gaattcatag aaagcaataa aatgcatgct 2880 
ggcctcacaa tgcacgtggg agatgaagtc 2940 
atagacttac acactgtaca ttttcacggc 3000 
tatagttctg atgtctttga cattttccct 3060 
agaacacctg gaatttggtt actccactgc 3120 
gaaaccactt acaccgttct acaaaatgaa 3180 
ggtgataagt ggaaaaaaga gaaaaaccaa 3240 
aatagaatgt tactttggaa tgactataaa 3300 

3321 



aattattggt ctttacaaga cttggataca 60 
aaatttctct ccaacctcct tcaaattcag 120 
ccctccagtg gggaaggctg cgatattaga 180 
ctgtgctcag aggaggtgag aggagaggaa 240 
tgaatctaga gtcttccccg aaaagcccag 300 
ctggtctaag gtggctgctt cttccccagc 360 
acgacctgtt atttccatga ctgctttact 420 
gataataaaa taatattctc ccaaaaaaaa 480 

487 



.<400> 177 

caagattcca catttgatgg ggtgactgac 
actgccaagt acagactcac attttatggg 
taccctcgtc gggccaacca ctggtctgcg 
gtactgtggg aatatggagg atatgccagc 
tcacccgtga aaatggagga agaaattcga 
aaagccaaag cccaatggcc agcctggcag 
gaattttccg tggacagaac gcgccattta 
cccgactgga acgtaggctt atctgcagaa 
cagaaggtgg tgcaagacct gattccctgg 



aaacccatct tagactgctg tgcctgcgga 60 
aattggtccg agaagacaca cccaaaggat 120 
atcatcggag gatcccactc caagaattat 180 
gaaggcgtca aacaagttgc agaattgggc 240 
caacagagtg atgaggtcct caccgtcatc 300 
cctctcaacg tgagagcagc accttcagct 360 
atgtccttcc tgaccatgat gggccctagt 420 
gatctgtgca ccaaggaatg tggctgggtc 4 80 
gacgctggca ccgacagcgg ggtgacctat 540 
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gagtcaccca acaaacccac cattccccag 
catcctcaga gtcctttcta tgacccagag 
gtcatcgaga gaatcgcacg gaagggtgaa 
gatattgtag ctgacctggc tccagaagag 
atctactcca actggtcccc atggtccgcc 
aggatgcgac agcgcatgct gaaagcacag 
caggacttcc agccctgcat gggccctggc 
atgtccgagt ggatcacctg gtcgccctgc 
cgggagaggt atgtgaagca gttcccggag 
gaaacggaga agtgcacggt caacgaggag 
tggggcgagt gggacgagtg cagcgccacc 
atgatcaaga tgaaccccgc agatggctcc 
aagtgcatga tgccagagtg ccacaccatc 
agtgactgca gcgtgacctg cgggaagggc 
ctggcagaac ttggagactg caatgaggat 
gaatgcccca ttgactgtga gctcaccgag 
tgtgggaaag gccacgtgat tcgaacccgg 
gcaccctgcc cagagactgt gcagcgaaaa 
ccatccatcc aaaagctacg ctggagggag 
aaggaagagt ctgaagggga gcagttccca 
tcagaatgca ccaaactgtg cggaggtgga 
agattcaaaa gctcccagtt taccagctgc 
gttcatcctt gttagcaagg gtacgagttc 
ccaatggctg gattatttgc ttgtttaaga 
tttgcagtgt ggttcgccca gtagtcttgt 
cttgatgggt acaggctgag tggggcgccc 
agtagtgtca gccaccttgt actaagctga 
agggaggacg gagactttga cctactccac 
atgcctgagt cccagggtgc ggcaggtagg 
ccacattctc atctttggcc tgttcaatga 
aactttaggt ctcttttcaa gtctcctcag 
tggtagactt gaagaggagc attgatgttg 
ggaatacatt tgtctcaccc ctgatattgg 
aataaattat ggctgcttta tttaaatata 
aataagctta gagtgtattt ttcccttgct 
ttctgggaag ccagccttct gaactttttg 
ggaagcaaaa ttggtctctt tagagaccaa 
catctagacg ttcaagtttg caaatcagtt 
cattttgcta agtctgccca aagccccccc 
gtactttaaa gttattttag tcatgaaatt 
agaaaacaaa tctaagggaa aggaatatta 
gaaagtcaaa cctgtcagtg ctccacacca 
atggccacag gtttacactg ccttcccagc 
gaccaccaag ggatagtgta aaaggacatt 
tcctgcattt attgttgaaa actattgttt 
ttaatccaaa tgtgtaccat tggtgagaca 
gtattaaaca catcagaata tttccaaata 
taacacaaga gagactattc aataaaaact 
aagtgttcag aaggttcttt tttatattgt 
agggcttttg ctcccttgtt cttggaggga 
tccaacatgt tttaaatcct taaatgaatt 
aaacacaaga tttaattatt tttctacttg 
cacttttgca atgttgttct aagctatcta 
aagctggtga ttcctaatca aggacaagcc 
cagttgggta cattttaaaa tcctgatttt 
atgggtaaca tgactettgt tggattgtta 
agcatcaagt ctctttctta ccaaagtctt 
aaatcatttt ggaaataaag attttttact 



gagaaaatcc ggcccctgac cagcctggac 600 
ggtgggtcca tcactcaagt agccagagtt 660 
caatgcaata .ttgtacctga caatgtcgat 720 
aaagatgaag atgacacccc tgaaacctgc 780 
tgcagctcct ccacctgtga caaaggcaag 840 
ctggacctca gcgtcccctg ccctgacacc 900 . 
tgcagtgacg aagacggctc cacctgcacc 960 
agcatctcct gcggcatggg catgaggtcc 1020 
gacggctccg tgtgcacgct gcccactgag 1080 
tgctctccca gcagctgcct gatgaccgag 1140 
tgcggcatgg gcatgaagaa gcggcaccgc 1200 
atgtgcaaag ccgagacatc acaggcagag 1260 
ccatgcttgc tgtccccatg gtccgagtgg 1320 
atgcgaaccc gacagcggat gctcaagtct 1380 
ctggagcagg tggagaagtg catgctccct 1440 
tggtcccagt ggtcggaatg taacaagtca 1500 
atgatccaaa tggagcctca gtttggaggt 1560 
aagtgccgca tccgaaaatg ccttcgaaat 1620 
gcccgagaga gccggcggag tgagcagctg 1680 
ggttgtagga tgcgcccatg gacggcctgg 1740 
attcaggaac gttacatgac tgtaaagaag 1800 
aaagacaaga aggagatcag agcatgcaat 1860 
cccagggctg cactctagat tccagagtca 1920 
caatttaaat tgtgtacgct agttttcatt 1980 
ggatgccaga gacatccttt ctgaatactt 2040 
tcacctccag ccagcctctt cctgcagagg 2100 
aacatgtccc tctggagctt ccacctggcc 2160 
atggagaggc aaccatgtct ggaagtgact 2220 
aaacattcac agatgaagac agcagattcc 2280 
aaccattgtt tgcccatctc ttcttagtgg 2340 
tcatcaatag ttcctgggga aaaacagagc 2400 
ggtggctttt gttctttcac tgagaaattc 24 60 
ttcctgatgc ccccccaaca aaaataaata 2520 
aggtagctag tttttacacc tgagataaat 2580 
tttgggggtt cagaggagta tgtacaattc 2640 
gtactaaatc cttattggaa ccaagacaaa 2700 
tttgcctaaa ttttaaaatc ttcctacaca 27 60 
tttagcaaga aaacattttt gctatacaaa 2820 
aatgcattcc ttcaacaaaa tacaatctct 2880 
ttatatgcag agagaaaaag ttaccgagac 2940 
tgggattaag ctgagcaagc aattctggtg 3000 
gggctgtggt cctcccagac atgcatagga 3060 
aattataagc acaccagatt cagggagact 3120 
ttctcagttg ggtccatcag cagtttttct 3180 
catttcttct tttataggcc ttattactgc 324 0 
catacaatgc tctgaataca ctacgaattt 3300 
caacatagta tagtcctgaa tatgtacttt 3360 
cactgggtct ttcatgtctt taagctaagt 3420 
cctccacctc catcattttc aataaaagat 3480 
ccattattac atctctgaac tacctttgta 3540 
gctttctccc aaaaaaagca caatataaag 3600 
gggggaaaaa agtcctcatg tagaagcacc 3660 
tctaactctc agcccatgat aaagttcctt 3720 
accctagtgt ctcatgtttg tatttggtcc 3780 
ggagacttaa aaccaggtta atggctaaga 3840 
ttttttgttt gcaatgggga atttataaga 3900 
gttaggtggt ttatagttct tttggctaac 3960 
acaaaaatg 3999 
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<210> 178 

<211> 1069 

<212> DNA 

<213> Homo sapiens 

<400> 178 

aaaaaagatg aataaatgaa taagagagat gaataaacaa atttacatta catgtgatag 60 
ttatcatggt atggccttca tgacaagatg gatgagaata tcactgatag gatattagcc 120 
ttctttcata tctttatatt gaaatatggg ctttacttca atttgaaggt ctttcatgaa 180 
caataaaaga gagtagaagg actgtctgag aaggcaggag acatataaaa cagatgactg 24 0 
aaagactgac tagctcctgg aaagggaaac atttggaaca tccagagtaa gggcaaatgg 300 
gcttctacca gcacaacaaa gagcctccag gtggcaacat ggaagcaggt tatcagagaa 360 
aataaatgtg caaattcctt atttacaatg actcacttaa ccccacaaac atgtttcact 420 
gctgccttcc ccagttgtcg cttatgtact gttgttacct ttcagttaca tgcctttgat 480 
cctaaaattc tctacttttg gtgccttatc agttctttgc aatctgcctg tggttatcag 540 
cacttaaagc acaattttga aggggaaaaa aatgataatc accttagtcc caaagaaata 600 
atttgtcaaa ctgccttatt agtattaaaa acagacacac tgaatgaagt agcatgatac 660 
gcatatatcc tactcagtat cattggcctt ttatcaaatg gggaaactat acttttgtat 720 
tacatagttt tagaaatcga aagttagaga ctctttataa gtaatgtcaa ggaacagtaa 780 
tttaaaaaca aagttctaac aaatatattg tttgcttaat cacaatgccc tcaacttgta 840 
tttgaataac taaataggac atgtcttcct tggagctgtg ggcattagtt cagaagcact 900 
acctgcatct taattttcaa aacttaagtt ttattagcaa atcctcttct ctgtaagact 960 
tagctatgaa gtggtatatt ttttccaaat atttttctga aaacatttgt tgttgtaact 1020 
gcacaataaa agtccagttg caattaaaaa aaaaaaaaaa aaaaaaaaa 1069 

<210> 179 

<211> 1817 

<212> DNA 

<213> Hoiao sapiens 

. <400> 179 

tgctattctg ccaaaagaca atttctagag tagttttgaa tgggttgatt tcccccactc 60 

ccacaaactc tgaagccagt gtctagctta ctaaaaaaag agttgtatat aatatttaag 120 

atgctgagta tttcatagga aagctgaatg ctgctgtaaa gtgctcttta agtctttttt 180 

ttttttaatc cccttctaat gaatgaaact aggggaattt caggggacag agatgggatt 240 

tgttgtatga taaactgtat gtagttttta gtctttctgt tttgagaagc agtggttggg 300 

gcatttttaa gatggctggc tactcttgtt ttccctcatg ataataaatt tgtcataact 360 

cagtaacatg aacttgcccc tagaggtagt tgttaataat tttgaaatat taaggtcttg 420 

ccaagcttct gatgattcac acctgtacta ctgattatta agcaggacag actgagcttt 480 

ctgttgcaaa taccttggag gagaaagtaa tttctaaata tacagagagg taacttgact 540 

atatatgttg catcctgtgc ctcccttcat attaatattt gataaagatt ttaatttatg 600 

taaaacttct aaagcagaat caaagctcct cttggggaaa tggcaagtct ttaggatagg 660 

caagaccctg tatgaatagt accaaagcat taccgcatgg tagagaacac actcgattaa 720 

aaatgttaag ctatctgaaa aataaaatgt gcaagtcttc aggatggcac aaaacaaagg 7 80 ' 

ttaatgcttc ttggggcaca tttcttagag ggcttgctga gtgtgtaaat ataatcgact 840 

tttgtttgtg ttacatgact tctgtgactt cattgaaaat ctgcacaatt cagtttcagc 900 

tctggattac ttcagttgac ctttgtgaag gtttttatct gtgtagaatg ggtgtttgac 960 

ttgttttagc ctattaaatt tttattttct ttcactctgt attaaaagta aaacttacta 1020 

aaagaaaaga ggtttgtgtt cacattaaat ggttttggtt tggcttcttt tagtcaggct 1080 

ttctgaacat tgagatatcc tgaacttaga gctcttcaat cctaagattt tcatgaaaag 1140 

cctctcactt gaacccaaac cagagtactc ttactgcctc ttttctaaat gtbcaggaaa 1200 

agcattgcca gttcagtctt ttcaaaatga gggagaaaca tttgcctgcc ttgtaataac 1260 

aagactcagt gcttattttt taaactgcat tttaaaaatt ggatagtata ataacaataa 1320 

ggagtaagcc accttttata ggcaccctgt agttttatag ttcttaatct aaacatttta 1380 

tatttccttc ttttggaaaa aacctacatg ctacaagcca ccatatgcac agactataca 14 40 

gtgagttgag ttggctctcc cacagtcttt gaggtgaatt acaaaagtcc agccattatc 1500 

atcctcctga gttatttgaa atgatttttt ttgtacattt tggctgcagt attggtggta 1560 
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gaatatacta taatatggat catctctact tctgtattta tttatttatt actagacctc 1620 

aaccacagtc ttctttttcc ccttccacct ctctttgcct gtaggatgta ctgtatgtag 1680 

tcatgcactt tgtattaata tattagaaat ctacagatct gttttgtact ttttatactg 1740 

ttggatactt ataatcaaaa cttttactag ggtattgaat aaatctagtc ttactagaaa 1800 

aaaaaaaaaa aaaaaaa 1817 



<210> 180 

<211> 2382 

<212> DNA" 

<213> Homo sapiens 



<400> 180 

acttttattg gaagcagcag ccacatccct 
cgggcagcca ctcctgagtg ataaccagta 
ctttatgacg acagcttgtt atggttgcag 
gtaacactcc ttagaaactg gcagtcgtat 
gatcaatttg gataccattt tgtccagatg 
agagagagac tctaagctca agttctggtt 
tgatattaaa gaaatggcct tttattttac 
ttttcctcct tttctttctg aaagtttcct 
cataaaaaat tagtatccct tttgtttggt 
ggtaattaca gcccctaaaa aaaacacatt 
ttagtgtaaa ccaggaattg gcacactttt 
gcttcacggt ccatacagtc tctgtcacaa 
cagctataca caatacagaa atgaatgagt 
aaaacaaggg gaggctgggt ttagcctgtg 
ccttagttat atatgatctg cattttcttg 
agaaaagcca cataatattt agtgtcatta 
gtcaaatact tacctttgga gaatacttac 
agagtcctgg atataggaaa aagtaattta 
aatgatagtc taacaactga gcaagatcct 
gagaccatta accaatactg gaactggtat 
ttatgcttca gaatacagtt gtttgccctg 
tatgtgaagc ttttccaaat agagctctca 
tgcattactt tgagttaaat ttgaatagag 
gtttttgtat tagcccagac atctgtaatg 
ttaaaatcat atccagcaca aaaactattt 
cctacctgta gagatcctcg tcatggaaag 
taagagtgag gccacagttc ccaccacacg 
cctttacttt ctggctgaag catccccttg 
ttcatggaac atagaacaac catgaatgag 
tacttatcta ataatcctct agaaagaacc 
aaagacagaa gacatgagga aaaacaaaag 
cttaacatca gatcttttct ataatatcct 
acacctaaac ctgtattatg aattacatat 
acagtacatt ctagttggaa tcgtttactc 
tcccaggtat agcaggctta tgtttggtgg 
tggcactttt gtaaacagat tgcttctaga 
tgaatactta gatttgtagc ttaatcacat 
ttgaacaaaa attatggcat ttaagaattt 
gttggttggt tttaaaatct ggtaactcca 
atgtctctaa taaagtattc atttgataaa 

<210> 181 
<211> 2377 
<212> DNA 

<213> Homo sapiens 



gcatgatttg cattgcaata caaccataac 60 
taacataaac gtagcagcct caatttttgc 120 
tttgggtctg gctttacgaa gatggcgacc 180 
gttagtttca cttgtctact ttatatgtct 24 0 
caaaaacatt ccaaaagtaa tgtgtttagt 300 
tatttcatgg atggaatgtt aattttatta 360 
atctctcccc tttttccctt tcccccttta 420 
tttatgtcca taaaatacaa atatattgtt 480 
tgctgagtca cctgaacctt aattttaatt 540 
tcaaataggc ttcccactaa actctatatt 600 
tttagaatgg gccagatggt aaatatttat 660 
ctattcagtt ctgctagtat agcgtgaaag 720 
gtggttatgt tctaataaaa cttatttata 780 
ggccatagtt tgtcaaccac tggtgtaaaa 84 0 
aactgatcat tgaaaactta taaacctaac 900 
tgcaataatc acattgcctt tgtgttaata 960 
ctttggagga atgtataaaa tttctcaggc 1020 
tgaagtaaac ttcagttgct taatcaaact 1080 
catctgagag tgcttaaaat gggatcccca 1140 
ctagctactg atgtcttact ttgagtttat 1200 
tgcatgaata tacccatatt tgtgtgtgga 1260 
gaagaattaa gtttttactt ctaattattt 1320 
tattaaatat aaagttgtag attcttatgt 1380 
tttttgcact ggtgacagac aaaatctgtt 14 40 
ctggctgaat agcacagaaa agtattttaa 1500 
gtgccaaact gttttgaatg gaaggacaag 1560 
agggcttttg tattgttcta ctttttcagc 1620 
gagtgccatg tataagttgg gctattagag 1680 
tggcatgatc cgtgcttaat gatcaagtgt 1740 
ctgttagatc ttggtttgtg ataaaaatat 1800 
gtttgaggaa atcaggcata tgactttata 1860 
actactttgg ttttcctagc tccataccac 1920 
tacaaagtca taaatgtgcc atatggatat 1980 
tgctagaatt taggtgtgag attttttgtt 2040 
cattaaattg gtttctttaa aatgctttgg 2100 
ttgttacaaa ccaagcctaa gacacatctg 2160 
tctagacttg tgagttgaat gacaaagcag 2220 
aacatgtctt agctgtaaaa atgagaaagt 2280 
tgatgaaaag aaatttattt tatacgtgtt 2340 
aaaaaaaaaa aa 2382 
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<400> 181 

atctttatgc aagacaagag tcagccatca gacactgaaa tatattatga tagattatga 60 
agaattttct ctgtagaatt atattcttcc tggaacctgg tagagtagat tagactcaaa 120 
ggctttttct tccttttctt actcctgttt tttccactca ctcttcccaa gagatttcct 180 
aaagcttcaa gcttaataag cctaatagtg aaaaataact gaatttaatg gtataatgaa 240 
gttcttcatt tccagacatc tttaattgat cttaaagctc atttgagtct ttgcccctga 300 
acaaagacag acccattaaa atctaagaat tctaaatttt cacaactgtt tgagcttctt 360 
ttcattttga aggatttgga atatatatgt tttcataaaa gtatcaagtg aaatatagtt 420 
acatgggagc tcaatcatgt gcagattgca ttctgttatg ttgactcaat atttaattta 480 
caactatcct tatttatatt gacctcaaga actccatttt atgcaatgca gaccactgag 540 
atatagctaa cattctttca aataattttc cttttctttt ataattcctc tatagcaaat 600 
ttttatgtat aactgattat acatatccat atttatattt cattgattcc aagacatcac 660 
tttttcaatt taacatctct gaaattgtga catttcttgc aactgttggc acttcagatg 720 
cagtgtttaa aattatgctt gaataaatat tacactaatc caactttacc taaatgttta 780 
tgcatctagg caaattttgt tttcttataa agatttgaga gcccatttat gacaaaatat 84 0 
gaaggcgaaa tttaaggaca actgagtcac gcacaactca acatggagcc taactgatta 900 
tcagctcaga tcccgcatat cttgagttta caaaagctct ttcaggtccc catttatact 960 
ttacgtgagt gcgaatgatt tcagcaaacc ctaacttaac taacaagaat gggtaggtat 1020 
gtctacgttt cattaacaaa tttttattat ttttattcta ttatatgaga tccttttata 1080 
ttatcatctc acttttaaac aaaattaact ggaaaaatat tacatggaac tgtcatagtt 1140 
aggttttgca gcatcttaca tgtcttgtat caatggcagg agaaaaatat gataaaaaca 1200 
atcagtgctg tgaaaaacaa ctttcttcta gagtcctctt actttttatt cttctttatc 1260 
atttgtgggt ttttccccct tggctctcac tttaacttca agcttatgta acgactgtta 1320 
taaaactgca tatttaaatt atttgaatta tatgaaataa ttgttcagct atctgggcag 1380 
ctgttaatgt aaacctgaga gtaataacac tactctttta tctacctgga atacttttct 1440 
gcataaaatt tatctttgta agctaactct attaatcagg tttcttctag cctctgcaac 1500 
ctacttcagt tagaattgtc taatactgct ctattaatca ggtttctacc ctctacaacc 1560 
tacttcagtt aaaattgtct aatacagcaa tatttaaaaa aaaaacactg caattgtcaa 1620 
ggatggaaaa tgtgtgattt gtgtaaacaa tttttaccaa ctttacattt tcctacagat 1680 
aaatgtgaaa ttttgataag aagtctacgc aatgacaagt acggtacata aattttatta 174 0 
agaatattga gtataaagta ctttaattct aaattataag aaaatataca tttgcacata 1800 
ttaatataga aattcatttt gtgtatattt aacatagctt ttaaactatt ttacattagc 1860 
tacttcatta tggtttcttg aacttctgaa aaaaattaga aatgtattaa acttatcagt 1920 
aacataaaaa cttattttgt ttcacctaac gaatactgcg tttgtaaaaa taaatttaat 1980 
atagaatata tttttaaatt aaatatttga atataaaata gctctaagaa agaagcaaat 2040 
tatcactgaa catatttctt attatttctg gctttgaatt atacgtaact taaattgtct 2100 
taaatgatac agaatattgg agaatatgat actttcacat aatatactat gaacctgttc 2160 
atataactct gattgactac taacttctgt tttatgtatt tattaaagag ctgacactgt 2220 
agtttgtggt gagatgttta tttttctaac agagcttata acagttagga caaggcattt 2280 
aattaatgca tcattctgtt tagtagtagg tgttaatcaa tatgaaattc tctgttttaa 2340 
aataaaaatg taaaaatcta aaaaaaaaaa aaaaaaa 2377 

<210> 182 

<211> 1370 

<212> DNA 

<213> Homo sapiens 

<400>.182 

tgtgagcatg gtattttgtc tcggaagaaa aaaatatggg tcaggcgcaa agtaagccca 60 
ccccactggg aactatgtta aaaaaaaatt tcaagattta agggagatta cggtgttact 120 
atgacaccag aaaaacttag aactttgtgt gaaatagact ggctaacatt agaggtgggt 180 
tggctatcag aagaaagcct ggagaggtcc cttgtttcaa aggtatggca caaggtaacc 240 
tgtaagccaa agcacccgga ccagtttcta tacatagaca gttacagctg gtttagaccc 300 
cttccccctc tccccacagt agttaagaga acagcagcat aagcagctgg cagaggcaag 360 
gaaagaccag cagagagaaa aaaaggccat ctataccaat tttaagttaa tttagactga 420 
acaaggg.ctt attaatagca aaggataatt gaaatcacaa acttataagg gtttcaacaa 480 
aagtgaagtt tgctaaaagt taacagtgta acatgtatta tggtaacttc taatcttgtg 54 0 
gccttagaca gtctagtcaa aacacataaa gaaagtttgc tttaaaaaaa caatggttat 600 
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cttcaaaaat 
tcctgaaata 
aaaacatgtt 
tttaagcaaa 
agatgcaaat 
ttgttatgct 
gtcctctagt 
gagtaataag 
caagggccat 
gaggaaactt 
agcttatcaa 
gacctttact 
ctatcccttt 

<210> 183 
<211> 2060 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 2003 

<223> n = A,T,C or G 
<400> 183 

gtttcagggg aggagacaag gtttcttgtt tgccgtatat gctcctgcag agaagaggaa 60 
gtgaccgtgg aggccatctg gccctgtgtt ttgatatggc aaaattaatg aatgcaatca 120 
gaagaccttt gagcaagaaa gtaccctgga acaacccaat ttggactgca agtattagtt 180 
gggtcttcca ggtgcctctc acagcagcag tcatggcagc agtgactcta gccatgtcca 240 
tgaccaactg ctgcataaca aatagccccg agactcagca gcttacaaca gggtccccag 300 
cccacagact ggcactggtc catggcttgt taggaacctg actgcgcagc agaaggtgag 360 
tgagcattac tgcctgagct ctgcctcctg tcagatcatc aggggcatta gattctcata 420 
ggagcgtgaa ccctattgca aaccgcgcat gcgaaggatg tacgttgcgt gctccttatg 480 
agaatctaac taatgcctga tgatttgagg tggggcagtt tcatccccaa accatctctc 540 
tcccttcatg tccatggaaa aattgtcttc tacaaaacca gtccgtggtg ccaaaaaggt 600 
tggagactgc tggtttacaa ccgcaatgaa cattcatcat cccacacagt gtcagagggt 660 
cgggaacacg ggtgccctgc ctgtgtgctt ccggttccag atttctcagt gggttgtgat 720 
caaggtatca gcggaggccg tattcatctg caagcttgac caggaataga agagccactt 780 
catgggtggc tcactcagat gccagcaggt cagtgctggt ggctggcagg cagcctcagc 840 
tcctcacctc atggatctct cctgagcaca gttttcctgt ccttacaacc tggtagctgg 900 
cttctccaga gcaggtgact caggagagga caaggtgaga gcccagcacc ttatggtcta 960 
gtctcagaag tcacacgcca tcatttctgc aatgtcattt tggggttcca ggtcagctgt 1020 
atcactgtgg gaggtgagta tatagatgtc ctagaccatt caggctgcta tgacagaaca 1080 
ccatgaactg agtggctcat gaacaacaga aatttcccac agttctgtag gctgggaaat 1140 
ccaagatcaa ggtggcagca ggttcagcgt ctgctaagct cctgcttttc atggattgca 1200 
tcttctcact gtgtcctcac gtgatggaca gagcaaatga gctctcaggc actagtccca 1260 
gccatgagga ctctgctttc atgactcatc actccgcaaa ggcccacctc catcagaaga 1320 
cagctgctaa ctgcagctgc catcctccaa gacgggagac acagaattgg gggacatata 1380 
cattgagatc tgaaaggcct ggacagcaac aggtggggat cgtgggggca tcttggaggg 1440 
tggctgccgc agtaacattt ctgacccatg ctttctgctt gcactcatct cctgcctttg 1500 
atcttcatta tctcargcag tccccacaac gactgtatct aggagttcat tttaccctca 1560 
ttttacagat gaaacgtctc agagggtaat gtgcttgccc agtgtctcac aaatgcaaag 1620 
tcactgaggt aggatttcaa cctaggtcca atcatctctg cagcattagg ggttcaccat 1680 
tgccatagac ttaactgtgt cccccaaaat ttgtatgttg aagccctacc agcctccccc 1740 
ccccaatgtg ctgatgtttg gagaaagggc ctttgggagg taattaggtt tagatgagat 1800 
catgagggtg ggactctcat aatggcatta atgccatcag gtgaagagat accagagacc 1860 
ttgtgtcctc tctctctgca atgtgaggac acagtgagaa ggcagctgtc tgcaagctgg 1920 
gaagagagta ctgaccagga acttaatcag agggcatctt gatcttggac ttcccagcct 1980 



aaaggggaga ggcagaattt atataaaaag 
aattaactgg ttgtttaaag aaaagaatgt 
taaaaaattg tctgcaaaag tcataaaaga 
aaatgttgta taatttaaaa gtaataaggc 
tcctggttga aatggatcaa atattccatc 
tgtgcacatg gcaggccaga ggccctgatt 
cgaccaggcg tggactgcat ggtagctctt 
tcatgccaag ctctctctgc tatatcccaa 
gcagcttctg tctcccaaca ctaagttcac 
agtattcctt ggagacctga agggatgcag 
tcagtcagcc cttgttcatc cccgagtgga 
gggcactctg ccaaataact agtgtggcac 
caccctggca tttcatcaac caaaaaaaaa 



agttatatga taaattcttg 660 
ttgtaataag tcaaaaagtt 720 
aaaaatttta ttaaaaaaat 780 
ctcctgtgta ctattaagac 840 
tgcacattaa acaaaagcaa 900 
gtcccccttc cactaaggtg 960 
ttccaggatt ctacagcctg 1020 
agtctctgcg ggtcagcccc 1080 
ttcgtgtctc tcacggcaga 114 0 
tgagcttaag aattttcaag 1200 
tgtgtggtgg tattgtggtg 1260 
ttgtgcttta gtccatttgg 1320 
aaaaaaaaaa 1370 
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ccagaactct gaaaagttaa tgnctattat 
gagccaaccc caagcttact 

<210> 184 
<211> 3079 
<212> DNA 

<213> Homo sapiens 
<400> 184 

ggcacaaagt tgggggccgc gaagatgagg 
actccggcac tgctggccct ggcgctgccc 
accctggaca aagtgcccaa gtcagagggc 
acgcggcgcg agggctacac cgagttcagc 
aagccgggaa ccagctaccg cgtaacactt 
ttcacattaa ttgccctcag agagaacaga 
accttccaga tcatagacga agaagaaact 
actgaaagca ctccacggag gaggacccgg 
ggaacaggct gcgtgattct gaaggccagc 
gatgagggct ctctgaccaa gaaactttgt 
gacaaaccca tcttagactg ctgtgcctgc 
gggaattggt ccgagaagac acacccaaag 
gcgatcat:cg gaggatccca ctccaagaat 
agcgaaggcg tcaaacaagt tgcagaattg 
cgacaacaga gtgatgaggt cctcaccgtc 
cagcctctca acgtgagagc agcaccttca 
ttaatgtcct tcctgaccat gatgggccct 
gaagatctgt gcaccaagga atgtggctgg 
tgggacgctg gcaccgacag cggggtgacc 
caggagaaaa tccggcccct gaccagcctg 
gagggtgggt ccatcactca agtagccaga 
gaacaatgca atattgtacc tgacaatgtc 
gagaaagatg aagatgacac ccctgaaacc 
gcctgcagct cctccacctg tgacaaaggc 
cagctggacc tcagcgtccc ctgccctgac 
ggctgcagtg acgaagacgg ctccacctgc 
tgcagcatct cctgcggcat gggcatgagg 
gaggacggct ccgtgtgcac gctgcccact 
gagtgctctc ccagcagctg cctgatgacc 
acctgcggca tgggcatgaa gaagcggcac 
tccatgtgca aagccgagac atcacaggca 
atcccatgct tgctgtcccc atggtccgag 
ggcatgcgaa cccgacagcg gatgctcaag 
gatctggagc aggtggagaa gtgcatgctc 
gagtggtccc agtggtcgga atgtaacaag 
cggatgatcc aaatggagcc tcagtttgga 
aaaaagtgcc gcatccgaaa atgccttcga 
gaggcccgag agagccggcg gagtgagcag" 
ccaggttgta ggatgcgccc atggacggcc 
ggaattcagg aacgttacat gactgtaaag 
tgcaaagaca agaaggagat cagagcatgc 
ttccccaggg ctgcactcta gattccagag 
agacaattta aattgtgtac gctagttttc 
tgtggatgcc agagacatcc tttctgaata 
ccctcacctc cagccagcct cttcctgcag 
tgaaacatgt ccctctggag cttccacctg 
cacatggaga ggcaaccatg tctggaagtg 
aggaaacatt cacagatgaa gacagcagat 
tgaaaccatt gtttgcccat ctcttcttag 



ttaagccacg cagtctatgg aattttgtta 2040 

2060 



ctgtccccgg cgcccctgaa gctgagccgg 60 
ctggccgcgg cgctggcctt ctccgacgag 120 
tactgtagcc gtatcctgcg cgcccagggc 180 
ctccgcgtgg agggcgaccc cgacttctac 240 
tcagctgctc ctccctccta cttcagagga 300 
gagggtgata aggaagaaga ccatgctggg 360 
cagtttatga gcaattgccc tgttgcagtc 4 20 
atccaggtgt tttggatagc accaccagcg 4 80 
atcgtacaaa aacgcattat ttattttcaa 540 
gaacaagatt ccacatttga tggggtgact 600 
ggaactgcca agtacagact cacattttat 660 
gattaccctc gtcgggccaa ccactggtct 720 
tatgtactgt gggaatatgg aggatatgcc 780 
ggctcacccg tgaaaatgga ggaagaaatt 840 
atcaaagcca aagcccaatg gccagcctgg 900 
gctgaatttt ccgtggacag aacgcgccat 960 
agtcccgact ggaacgtagg cttatctgca 1020 
gtccagaagg tggtgcaaga cctgattccc 1080 
tatgagtcac ccaacaaacc caccattccc 1140 
gaccatcctc agagtccttt ctatgaccca 1200 
gttgtcatcg agagaatcgc acggaagggt 1260 
gatgatattg tagctgacct ggctccagaa 1320 
tgcatctact ccaactggtc cccatggtcc 1380 
aagaggatgc gacagcgcat gctgaaagca 1440 
acccaggact tccagccctg catgggccct 1500 
accatgtccg agtggatcac ctggtcgccc 1560 
tcccgggaga ggtatgtgaa gcagttcccg 1620 
gaggaaatgg agaagtgcac ggtcaacgag 1680 
gagtggggcg agtgggacga gtgcagcgcc 1740 
cgcatgatca agatgaaccc cgcagatggc 1800 
gagaagtgca tgatgccaga gtgccacacc 1860 
tggagtgact gcagcgtgac ctgcgggaag 1920 
tctctggcag aacttggaga ctgcaatgag 1980 
cctgaatgcc ccattgactg tgagctcacc 2040 
tcatgtggga aaggccacgt gattcgaacc 2100 
ggtgcaccct gcccagagac tgtgcagcga 2160 
aatccatcca tccaaaagcc acgctggagg 2220 
ctgaaggaag agtctgaagg ggagcagttc 2280 
tggtcagaat gcaccaaact gtgcggaggt 2340 
aagagattca aaagctccca gtttaccagc 2400 
aatgttcatc cttgttagca agggtacgag 2460 
tcaccaatgg ctggattatt tgcttgttta 2520 
atttttgcag tgtggttcgc ccagtagtct 2580 
cttcttgatg ggtacaggct gagtggggcg 2640 
aggagtagtg tcagccacct tgtactaagc 2700 
gccagggagg acggagactt tgacctactc 2760 
actatgcctg agtcccaggg tgcggcaggt 2820 
tccccacatt ctcatctttg gcctgttcaa 2880 
tggaacttta ggtctctttt caagtctcct 2940 
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cagtcatcaa tagttcctgg ggaaaaacag 
ttgggtggct tttgttcttt cactgagaaa 
tggttcctga tgccccagc 

<210> 185 

<211> 3000 

<212> DNA 

<213> Homo sapiens 

<400> 185 

gtttcagggg aggagacaag gtttcttgtt 
gtgaccgtgg aggccatctg gccctgtgtt 
gaagaccttt gagcaagaaa gtaccctgga 
gggtcttcca ggtgcctctc acagcagcag 
tgaccaactg ctgcataaca aatagccccg 
cccacagact ggcactggtc catggcttgt 
tgagcattac tgcctgagct ctgcctcctg 
ggagcgtgaa ccctattgca aaccgcgcat 
agaatctaac taatgcctga tgatttgagg 
tcccttcatg tccatggaaa aattgtcttc 
tggagactgc tggtttacaa ccgcaatgaa 
cgggaacacg ggtgccctgc ctgtgtgctt 
caaggtatca gcggaggccg tattcatctg 
catgggtggc tcactcagat gccagcaggt 
tcctcacctc atggatctct cctgagcaca 
cttctccaga gcaggtgact caggagagga 
agtctcagaa gtcacacgcc atcatttctg 
tatcactgtg ggaggtgagt atatagatgt 
accatgaact gagtggctca tgaacaacag 
tccaagatca aggtggcagc aggttcagcg 
atcttctcac tgtgtcctca cgtgatggac 
agccatgagg actctgcttt catgactcat 
acagctgcta actgcagctg ccatcctcca 
acattgagat ctgaaaggcc tggacagcaa 
gtggctgccg cagtaacatt tctgacccat 
gatcttcatt atctcaggca gtccccacaa 
attttacaga tgaaacgtct cagagggtaa 
gtcactgagg taggatttca acctaggtcc 
ttgccataga cttaactgtg tcccccaaaa 
cccccaatgt gctgatgttt ggagaaaggg 
tcatgagggt gggactctca taatggcatt 
cttgtgtcct ctctctctgc aatgtgagga 
ggaagagagt actgaccagg aacttaatca 
tccagaactc tgaaaagtta atgtctatta 
agagccaacc caagcttact aagataatca 
tacatttata aaaacaaaac aagagatttg 
gatggaataa caaatctaca tacagattta 
tatctaatag agacatagtg tctgtggtct 
attgggcctt gtccaggtat tcttctccct 
tctttgctaa cccataacca actattgtat 
gtttgggttt actgtttctc accatttaga 
ctttagtgat atcagttttt gattcattcc 
ggaggtgtat tcaccatcca gttctctgga 
ttgtcccaaa aggagccttc ctaaaatagt 
attaagagaa aaaaaacccc aaactcattc 
cactgcagtg gcctgagttt tgctgttaat 
tctgcctaga gctggaatgt cattatgtaa 
gtaccagtaa tgtatatctt attcagcttc 



agctggtaga cttgaagagg agcattgatg 3000 
ttcggaatac atttgtctca cccctgatat 3060 

3079 



tgccgtatat gctcctgcag agaagaggaa 60 
ttgatatggc aaaattaatg aatgcaatca 120 
acaacccaat ttggactgca agtattagtt 180 
tcatggcagc agtgactcta gccatgtcca 240 
agactcagca gcttacaaca gggtccccag 300 
taggaacctg actgcgcagc agaaggtgag 360 
tcagatcatc aggggcatta gattctcata 420 
gcgaaggatg tacgttgcgt gctccttatg 480 
tggggcagtt tcatccccaa accatctctc 540 
tacaaaacca gtccgtggtg ccaaaaaggt 600 
cattcatcat cccacacagt gtcagagggt 660 
ccggttccag atttctcagt gggttgtgat 720 
caagcttgac caggaataga agagccactt 780 
cagtgctggt ggctggcagg cagcctcagc 840 
gttttcctgt ccttacaacc tggtagctgg 900 
caaggtgaga gccacagcac cttatggtct 960 
caatgtcatt ttggggttcc aggtcagctg 1020 
cctagaccat tcaggctgct atgacagaac 1080 
aaatttccca cagttctgta ggctgggaaa 1140 
tctgctaagc tcctgctttt catggattgc 1200 
agagcaaatg agctctcagg cactagtccc 12 60 
cactccgcaa aggcccacct ccatcagaag 1320 
agacgggaga cacagaattg ggggacatat 1380 
caggtgggga tcgtgggggc atcttggagg 1440 
gctttctgct tgcactcatc tcctgccttt 1500 
cgactgtatc taggagttca ttttaccctc 1560 
tgtgcttgcc cagtgtctca caaatgcaaa 1620 
aatcatctct gcagcattag gggttcacca 1680 
tttgtatgtt gaagccctac cagcctcccc 1740 
cctttgggag gtaattaggt ttagatgaga 1800 
aatgccatca ggtgaagaga taccagagac 1860 
cacagtgaga aggcagctgt ctgcaagctg 1920 
gagggcatct tgatcttgga cttcccagcc 1980 
tttaagccac gcagtctatg gaattttgtt 2040 
gtatgctgca ctttctataa atgtaatttt 2100 
ctgctctata acaactgtac ctacattgta 2160 
gtaatctcta tgtagatata gaacatagtg 2220 
gatgttaatt ttaggaatta gccgtcactg 2280 
tgtcctggct ctgtaaccta gttatccttg 2340 
caggactatt atgccactac agatgatgca 2400 
caatacttca tcaaatatat ttctgtatga 24 60 
tgcatagatc tgggcaaatt gtagacctta 2520 
actgcttatg acatttttct ctgagctttc 2580 
ctttaagtgc ctttaaaaag agaaagagaa 2640 
ctttactctg atgtgacagt cctcccagga 2700 
ttcattcact tatgtttggg ctatgtaaat 2760 
agaaatattt tttgtttata ttctttaata 2820 
gagaatataa ttgggttgtt tataaaaacc 2880 
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acacatcatc aaactcacat tgtaacgatt atttcacttt tcaaaaaaaa tggcattaga 2940 
aaaacttgaa tgatgttagt tatcttaaag aagtgtgtac tatgtttaaa aaaaaaaaaa 3000 



<210> 186 
<211> 807 
<212> PRT 
<213> Homo sapiens 



<400> 186 
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He 
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Asp 
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Thr 
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Asp Lys 
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He 
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Cys 


VjX jr 
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xXxX 
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^ u u 














Leu 


Thr 
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Phe 


Tyr 


Gly 


Asn 


Trp 

^ X «J 
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Glu 


Lys 


Thr 


His 


Pro Lys Asp 


Tyr 


Pro Arg 


Arg Ala Asn 


His 


Trp 


Ser 


Ala 


He 


He 


Gly 


Gly Ser His 


Ser 


225 










230 
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Lys 


Asn 


Tyr Val 


Leu 


Trp 


Glu 


Tyr 


Gly 


Gly 


Tyr 


Ala 


Ser Glu Gly 


Val 










245 










250 






255 




Lys 


Gin 


Val 


Ala 
260 


Glu 


Leu 


Gly 


Ser 


Pro 
265 


Val 


Lys 


Met 


Glu Glu Glu 
270 


He 


Arg Gin 


Gin 


Ser Asp 


Glu 


Val 


Leu 


Thr 


Val 


He 


Lys 


Ala Lys Ala 


Gin 
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Trp 


Pro 


Ala 


Trp Gin 


Pro 


Leu 


Asn 


Val 


Arg 


Ala 


Ala 


Pro Ser Ala 


Glu 
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295. 
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Phe 


Ser 


Val Asp Arg 


Thr 


Arg 


His 


Leu 


Met 


Ser 


Phe 


Leu Thr Met 


Met 


305 
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315 
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Gly 


Pro 


Ser 


Pro Asp 


Trp 


Asn 


Val 


Gly 


Leu 


Ser 
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Glu Asp Leu 


Cys 










325 










330 






335 




Thr 


Lys 


Glu 


Cys 
340 


Gly 


Trp 


Val 


Gin 


Lys 
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Val 
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Gin 


Asp Leu He 
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Pro 


Trp Asp 
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Gly Thr 


Asp 


Ser 
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Val 


Thr 
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Glu 
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Lys 
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Pro 


Thr 
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He 


Pro 


Gin 


Glu 


Lys 
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He 
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Pro 


Leu 


Thr 
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Ser Leu Asp 


His 
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Pro Gin 


Ser 


Pro Phe 


Tvr 


Asp 


Pro 
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Ala Arg 
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Val lie 


Glu 
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lie 
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lie Val 
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585 






Gin 


Ala Glu Lvs Cvs M^t 


Met! Pro 

cue Im £ X U 




QUO 




JJCU 


QttT" Pt~o Tm^ Qor" 121 in 
OCSX ITX U -i-X^ OCX uxu 


xxp oer 








Glv 


Met Aro Thi* Aro f3ln 

Ue AX ^ X IIX r^X WXII 


AX %J L ic? U 








Asp 


A^n fil n AQn T<<aii 

'^yj /roll \3X U fl9j^ JJCU 


Gill Gin 






6S\^ 

DOS 


Cys 


PT*rt Tl P Acr> r^xfG Olii 
irx^J xxc Ao^ v^^o l^J-U. 


Jjcu Xiix 




0 / U 




Asn 


T.\7Q C*\7<Z f^l \r T.\7<3 


Gl \/ Ht « 




6RS 

Q O o 




Met 


Gill Pt*q dn PViP Glv 

wXU cXvj %3Xil JtllC V^Xy 


Glv Ala 
oxjr AX a 








Lys 


TiVQ f^v^ Atti Tie* Ai^rr 


xjyo \*yo 






ion 


Pro 


Arg Trp Arg Glu Ala 


Arg Glu 




730 


735 


Glu 


Glu Ser Glu Gly Glu 


Gin Phe 


745 


750 




Thr 


Ala Trp Ser Glu Cys 


Thr Lys 




765 




Arg 


Tyr Met Thr Val Lys 


Lys Arg 




780 




Cys 


Lys T^p Lys Lys Glu 


He Arg 




795 


800 



<210> 187 
<211> 892 
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<212> DNA 

<213> Homo sapiens 



<400> 187 

tttattgatg tttcaacagg cacttattca 
taagcatcct tggcttctca cccattcctc 
ggtaccctga atagcactaa gtgctctgta 
cccacaggca gagaaagaac tcctcaagtg 
gcacacctat gatgaaggtc ttcacctcca 
cctggagagt gggagaaaat ccctcctttg 
cctcattgct tagtgatcat ttggaaggca 
aatgaaaatg agtacagttc agatggtgga 
tttttctcag tgtctgcaac tccaaagatc 
agattagttc tttgtcaagt gaatgaaatc 
ccgcctgttg taaaatctat tttcccccaa 
agttctaaaa gatacccttg gaattatcag 
acagtctcac cagacttctc ataatgctct 
gagtttatga gggtaaagct ctactttcct 
atctattttg tagccaataa aactccgcac 

<210> 188 
<211> 1448 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> 1124 

<223> n « A,T,C or G 



aataagttat atatttgaaa acagccatgg 60 
atgtggcatg ctttctagac tttaaaatga 120 
agctcaagga atctgtgcag tgctacaaag 180 
cttgtggtca gagactaggt tccatatgag 240 
gaaggtgaca ctgttcagag atcctcattt 300 
ggaaatccct tttcccagca gcagagccca 360 
ctgagagcct tcaggggctg acagcagaga 420 
agaagcatgg cagtgacatc ttccatgctc 480 
aaggccataa cccaggagac catcaacgga 540 
caaaagcacg catgagacca atgaaagttt 600 
ggaaagtcct tgcacagaca ccagtgagtg 660 
actcagaaac ttttattttt tttttctgta 720 
taatatattg cacttttcta atcaaagtgc 780 
actgcagcct tcagattctc atcattttgc 840 
tagcaaaaaa aaaaaaaaaa aa 892 



<400> 188 

tgtgactcac atttctttta ctgtgacaca 
gagtgtttac aactcaaaca actttttgaa 
ttttatgtca aatttttttt cttagaagta 
aggccatggg gaactttgtg caagtacctc 
taatttcaga aaatttcctc atatacgtag 
tttttcttca taatatttac tcaaagttaa 
atttgaagac agtaagacag taaactattt 
cagttattct ggtaaaaata ggcaaaagtg 
ccaagtaaga gagattgtaa atgataaacc 
aaggaagctg ggcacatgtc aaaaagggag 
acagaaaata tttaaagtgg ctcataggta 
tctggaatct ttacatcctt tgccagctga 
atggtcagtt tattttgtaa gagacagaag 
cagtacctgg aagccccagc ccgtcacaga 
ttccctagcc cttgtgattt gaagcatgaa 
ataaagtatg gttttatttt ggtgtaattt 
aaaaggcatt attggaaata tttgaaaact 
aaatttctga ctgtcagtag aagtgagata 
aatcattaag atagtaaaac aggcaaagca 
taaacattgg aatgcataag ttttaatatt 
tactaactgc cctctcccaa gattcattta 
tgccacatgc attcaatggg atcttttaag 
tactgaaatc aaattaattt gtattcaatg 
gaaagcctac aatgaatcat tgttcamcct 
aaaaaaaa 



ataatgtgat cctaaaactg gcttatcctt 60 
tgcagtagtt tttttttttt aaaaacaaac 120 
gtcttcatta ttataaattt gtacaccaaa 180 
atcgctgagc aaatggagct tgctatgttt. 240 
tgtgtagaat caagtctttt aataattcat 300 
gcttaaaaat aagttttatc ttaaaatcat 360 
taggaagtca acccccattg cactctgtgg 420 
acctgaatct acaatggtgt cccaaagtaa 480 
gagctttaaa ggataaagtg ttaataaaga 540 
atcgaaatgt taggtaatca tttagaaagg 600 
atgaatattt ctgacttaga tgtaaatcca 660 
aacaagaaag tgaagggaca atgatatttc 720 
aaattatatc tatacattac cttgtagcag 780 
agtgtggagg ggggctcctg actagacaat 840 
agttctggca ggttatgagc agcactaggg 900 
aggtttttca acaaagccct tgtctaaaat 960 
agaaaatgat ggataaaagg gctgataaga 1020 
agatcctcag aggaaacagt aagaagggat 1080 
gaatcacatg tgcncacaca catacacatg 1140 
ttagcgctat cagtttctaa atgcattaat 1200 
gttcaaacag tatccgtaaa ctaggaataa 1260 
tactcttcag tttgttccaa gaaatgtgcc 1320 
tgtacttcaa gactgctaat tgtttcatct 1380 
tgaaaaataa aattttgtaa atcaaaaaaa 1440 

1448 



<210> 189 
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<211> 460 
<212> DNA 

<213> Homo sapiens 
<400> 189 

ttttgggagc acggactgtc agttctctgg 
cctcctctgt cttttggaga accagggctc 
tttcagccag gaaggccaaa atcaagagtg 
ggagttggtg aatcggttgt tctttcctca 
atgttcctcc ttttcttcac cctccccttt 
gttaatggga tggtcggatc tcacaggctg 
aaaagctgct tcttattaat catacaaact 
cttcaaaata aaaagtaatg acttaaaaaa 

<210> 190 
<211> 481 
<212> DNA 

<213> Homo sapiens 



gaagtggtca gcgcatcctg cagggcttct 60 
ttctcagggg ctctagggac tgccaggctg 120 
agatgtagaa agttgtaaaa tagaaaaagt 180 
catttggatg attgtcataa ggtttttagc 240 
tttcttctat taatcaagag aaacttcaaa 300 
agaactcgtt cacctccaag catttcatga 360 
ctcaccatga tgtgaagagt ttcacaaatc 420 
aaaaaaaaaa 460 



<400> 190 

aggtggtgga agaaactgtg gcacgaggtg actgaggtat ctgtgggagc taatcctgtc 60 
caggtggaag taggagaatt tgatgatggt gcagaggaaa ccgaagagga ggtggtggcg 120 
gaaaatccct gccagaacca ccactgcaaa cacggcaagg tgtgcgagct ggatgagaac 180 
aacaccccca tgtgcgtgtg ccaggacccc accagctgcc cagcccccat tggcgagttt 240 
gagaaggtgt gcagcaatga caacaagacc ttcgactctt cctgccactt ctttgccaca 300 
aagtgcaccc tggagggcac caagaagggc cacaagctcc acctggacta catcgggcct 360 
tgcaaataca tccccccttg cctggactct gagctgaccg aattccccct gcgcatgcgg 420 
gactggctca agaacgtcct ggtcaccctg tatgagaggg atgaggacaa caaccttctg 480 
a 481 



<210> 


191 


<211> 


489 


<212> 


DNA 


<213> 


Homo I 


<220> 




<221> 


misc : 


<222> 


312, ^ 


<223> 


n = Aj 



455 



<400> 191 

atataaatta gactaagtgt tttcaaataa atctaaatct tcagcatgat gtgttgtgta 60 
taattggagt agatattaat taagtcccct gtataatgtt ttgtaatttt gcaaaacata 120 
tcttgagttg tttaaacagt caaaatgttt gatattttat accagcttat gagctcaaag 180 
tactacagca aagcctagcc tgcatatcat tcacccaaaa caaagtaata gcgcctcttt 24 0 
tattattttg actgaatgtt ttatggaatt gaaagaaaca tacgttcttt tcaagacttc 300 
ctcatgaatc tntcaattat aggaaaagtt attgtgataa aataggaaca gctgaaagat 360 
tgattaatga actattgtta attcttccta ttttaatgaa tgacattgaa ctgaattttt 420 
tgtctgttaa atgaacttga tagctaataa aaagncaact agccatcaaa aaaaaaaaaa 480 
aaaaaaaaa 489 

<210> 192 
<211> 516 
<212> DNA 

<213> Homo sapiens 
<400> 192 

acttcaaagc cagctgaagg aaagaggaag tgctagagag agcccccttc agtgtgcttc 60 
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tgacttttac ggacttggct tgttagaagg 
gcttgtatgc atgctactcc tggctttcag 
ggaaatgaaa gcattagaag cagatttctt 
agcacatgtt ccctcttgga agatgactct 
gaacagccca gctgaggaaa caggagaagt 
cttcttactg ctttagatgg ctttagcttg 
aaaatctgtc acagcagggc ttttcaacac 
actggaaatg acaaaaatgg aaaggaagaa 

<210> 193 

<211> 1409 

<212> DNA 

<213> Homo sapiens 

<400> 193 

tgattctttt ccaaaacttt tagccatagg 
aattgagaaa tataagatga aaaggaatgg 
gtgatctgaa atcttgggag aagctgttct 
ctgcgcactg tgtaccccga gcaacattct 
ttgacctcat tcttcatata gtagtctagg 
tggtacatag taactgaatt tctattccta 
acacatttta tactttgcat ctccaaattt 
tagagaacat tatgtttgta tcatttcttt 
tttcatcaga cccagtgaaa actaaggata 
ggagaacaca atccaccatt gtcatttaag 
tcttgttaaa tagatttaac aggaacatct 
ttaattgcag tggattaata cagcaagagt 
ttcaagactt agttattgtc acactaattg 
aggaacatgt gaagctaatc tgctcaaaaa 
tgcataattg gctgcaatta tttaatgttt 
tcacaagtgc attaatataa acagaactgg 
ttgcatgttc tcttcctttc acttttttca 
ttttttgaaa acacatcagt agaaaccaag 
aaacctgagc agttcctaaa aagatttgct 
ttaaggattc aactcgtgat acaccaaaag 
tctctagcac aggaatgaat aaatttataa 
caaaggaaaa gtgaaaggga aagagaaaca 
gttttttcta cctggtaaac attctctatt 
tgtaagmcaa aaaaaaaaaa aaaaaaaaa 



ctgaaagatg atggcaggaa tgaaaatcca 120 
ctcctggagt ctgtgctcag attcagaaga 180 
gaccaatatg catacatcaa agattagtaa 240 
gctaaatgtt tgcagtcttg taaataattt 300 
tcatgaagag gagcttgttg caagaaggaa 360 
gaagcaatgt tgacaatata ccagctccac 420 
tgggagttaa tccaggaaga tattcttgat 480 
gtcata 516 



gtcttttata gacagggata gtaaaatgaa 60 
taaaaatatc ttttaggggg cttttaattg 120 
tttcaggcct gaggtgctct tgactgtcgc 180 
aagggtgtgc tttcgccttg gctaactcct 240 
aaaaagttgc aggtaattta aactgtctag 300 
tgagaaatga gaattattta tttgccatca 360 
attgcggcga gacttgtcca ttgtgaaagt 420 
cataaaacct caagagcatt tttaagccct 4 80 
gatgtttttt aactggaggt ctcctgataa 540 
taataagaca ggaaattgac cttgacgctt 600 
gcacatcttt tttccttgtg cactatttgt 660 
gccacattat aactaggcaa ttatccattc 720 
atcgtttaag gcataagatg gtctagcatt 780 
gatcaacaaa ttaatattgt tgctgatatt 840 
aattgggttg atcaaatgag attcagcaat 900 
ggcacttaaa atgataatga ttaacttata 960 
gtgtctacat ttcagaccga gtttgtcagc 1020 
attttaaaat gaagtgtcaa gacgaaggca 1080 
gttagaaatt ttctttgtgg cagtcattta 1140 
aagagttgac ttcagagatg tgttccatgc 1200 
cacctgcttt agcctttgtt ttcaaaagca 1260 
agtgactgag aagtcttgtt aaggaatcag 1320 
cttttctcaa aagattgttg taagaaaaaa 1380 

1409 



<210> 194 
<211> 441 
<212> DNA 
<213> Homo sapiens 

<400> 194 

cagatttcgg tagccatctc cctccaaata 

tttccccttc tcctttcttc tgtcactgcc 

aactggccgt aatgtggaat tgatatttac 

tacgggattg cctcatttcc tgctctgaat 

gtttcctcac aaaagtcaac aaagtccaaa 

attgaaaaag gaaattgtgc ctcttgcagc 

ctttccaccc tcacgatgac ttgcggttct 

caactaaaaa aaaaaaaaaa a 



tgtctctttc tgctttctta gtgcccatta 60 
atctccttct tggtcttccc attgttcttt 120 
attttgatac ggtttttttc ttggcctgtg 180 
tttaaaatta gatattaaag ctgtcatatg 24 0 
caaaaatagt ttgccgtttt actttcatcc 300 
ctaggcaaag gacatttagt actatcgatt 360 
ctctgtagaa aagggatggc ctaagaaata 420 

441 



<210> 195 
<211> 707 
<212> DNA 
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<213> Homo sapiens 
<400> 195 

cagaaaaata tttggaaaaa atataccact tcatagctaa gtcttacaga gaagaggatt 60 

tgctaataaa acttaagttt tgaaaattaa gatgcaggta gagcttctga actaatgccc 120 

acagctccaa ggaagacatg tcctatttag ttattcaaat acaagttgag ggcattgtga 180 

ttaagcaaac aatatatttg ttagaacttt gtttttaaat tactgttcct tgacattact 240 

tataaagagt ctctaacttt cgatttctaa aactatgtaa tacaaaagta tagtttcccc 300 

atttgataaa aggccaatga tactgagtag gatatatgcg tatcatgcta cttcattcag 360 

tgtgtctgtt tttaatacta ataaggcagt ttgacagaaa ttatttcttt gggactaagg 420 

tgattatcat ttttttcccc ttcaaaattg tgctttaagt gctgataacc acaggcagat 480 

tgcaaagaac tgataaggca acaaaagtag agaattttag gatcaaaggc atgtaactga 540 

aaggtaacaa cagtacataa gcgacaactg gggaaggcag cagtgaaaca tgtttgtggg 600 

gttaagtgag tcattgtaaa taaggaattt gcacatttat tttctgtcga cgcggccgcc 660 

actgtgctgg atatctgcag aattccacca cactggacta gtggatc 707 

<210> 196 
<211> 552 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__feature 

<222> 61, 129, 189, 222, 241, 278, 324, 338, 363, 408, 415, 463, 

483 

<223> n « A,T,C or G 
<400> 196 

tggccagcca gcctgatgtg gatggcttcc ttggggtggt gcttccctca agcccgaatt 60 
ngtggacatc atcaatgcca aacaatgagc cccatccatt ttccctaccc ttcctgccaa 120 
gccagggant aagcagccca gaagcccagt aactgccctt tccctgcata tgcttttgat 180 
ggtgtcatnt gctccttcct gtggcctcat ccaaactgta tnttccttta ctgtttatat 240 
nttcaccctg taatggttgg gaccaggcca atcccttntc cacttactat aatggttgga 300 
actaaacgtc accaaggtgg cttntccttg gctgaganat ggaaggcgtg gtgggatttg 360 
ctnctgggtt ccctaggccc tagtgagggc agaagagaaa ccatcctntc ccttnttaca 420 
ccgtgaggcc aagatcccct cagaaggcag gagtgctgcc ctntcccatg gtgcccgtgc 4 80 
ctntgtgctg tgtatgtgaa ccacccatgt gagggaataa acctggcact aggaaaaaaa 540 
aaaaaaaaaa aa 552 

<210> 197 
<211> 449 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 56, 58, 76 
<223> n = A,T,C or G 

<400> 197 

ctccagagac aacttcgcgg tgtggtgaac tctctgagga aaaacacgtg cgtggnanca 
agtgactgag acctanaaat ccaagcgttg gaggtcctga ggccagccta agtcgcttca 
aaatggaacg aaggcgtttg cggggttcca ttcagagccg atacatcagc atgagtgtgt 
ggacaagccc acggaga6tt gtggagctgg cagggcagag cctgctgaag gatgaggccc 
tggccattgc ccgccctgga gttgctgccc agggagctct tcccgccact cttcatggca 
gcctttgacg ggagacacag ccagaccctg aaggcaatgg tgcaggcctg gcccttcacc 
tgcctccctc tgggagtgct gatgaaggga caacatcttc acctggagac cttcaaagct 
gtgcttgatg gacttgatgt gctccttgc 



60 

120 

180 

240 

300 

360 

420 

449 
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<210> 198 
<211> 606 
<212> DNA 
<213> Homo sapiens 



<400> 198 

tgagtttgcc 

attgtcacac 

tgctaaacat 

atgtccataa 

agcttgcctt 

ctcaacgtcc 

ttgatttcaa 

tcctagtttt 

tcaccctagc 

tatactttaa 

ctgcac 



cccttacccc 
ctgggtgggg 
ctttcaacgc 
cactgctgtt 
tgttctggct 
cgagccaggg 
aatctcagca 
agcattgaaa 
agctgaggga 
tttgtaacat 



catcccagtg 
aacatgctac 
acaggacaga 
gagaaaacct 
tctgtagata 
ctcaaggcaa 
actagaagaa 
gtttcaggtt 
ctcttcaata 
gtgaaaacat 



aatatttgca 
tggcatctaa 
gccccacaaa 
accgcaggat 
tataaaataa 
ttccaataac 
tgaccaacca 
ccaggaaagc 
cagaattagt 
ctataaacat 



attcctaaag 
tgcatagagg 
agagaattat 
cttactgggc 
agacactgcc 
agtagaatga 
tcctggttgg 
cctcaggcct 
ctttgtgcac 
ctactgaagc 



acgtgttttg 60 
gcagtaatgc 120 
ctagccccaa 180 
ttcataggta 240 
cagtccctcc 300 
acactaaata 360 
cctgggactg 420 
gggctgctgg 480 
tggagatgaa 540 
ctgttcttgt 600 
606 



<210> 199 
<211> 369 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 29, 345 

<223> n = A, T,C or G 



<400> 199 

ggcaactttt 

gtgtgaagaa 

ggtgaacgtt 

ccgcaagtcc 

ctgctcccca 

cgggccaagg 

caccatcct 



tgcggattgt 
ttccagctga 
caagacatgt 
tgtgcatcat 
gggaaactga 
cccaagaaaa 



tcttgcttnc 
acaacgactg 
gtcagaaaga 
cagcggcctg 
actcagtttg 
ggggaagttc 



aggctttgcg 
ctcctccccc 
agtgatggag 
tctcatcgcc 
catcagctgc 
tgcctcggcc 



ctgcaaatcc 
gagttcattg 
caaagtgccg 
tctgccgggt 
tgcaacaccc 
ctcangccat 



agtgctacca 60 
tgaattgcac 120 
ggatcatgta 180 
accagtcctt 240 
ctctttgtaa 300 
ggctccgcac 360 
369 



<210> 200 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<400> 200 

Met Tyr Arg Asn Trp Ser Gly Cys Phe Gly Leu Gin Val Thr Leu Cys 

1 5 10 15 

His Thr Phe Glu Thr Arg Asp Leu Ser Arg Leu Ser Ser Asp Ser Gin 

20 25 30 

Pro Thr Ser Asn Val Ser Gin Ser lie Ser His Lys Val Leu Ser Phe 

35 40 45 

Ser Gly Val lie Val Thr Pro 
50 55 



<210>'201 

<211> 67 

<212> PRT 

<213> Homo sapiens 
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<400> 201 












Met Gin Leu 


Leu 


Ser 


Pro 


Asn 


Thr 


1 




5 








Gin Arg Gly 


Asn 
20 


Leu 


Val 


Phe 


Leu 


Leu Lys Asn 


Phe 


Gin 


Glu 


Leu 


lie 


35 










40 


Arg Val Asp 


Val 


Trp Trp 


Tyr Cys 


50 








55 




Pro Asn Asn 













Lys Phe Thr Ser Cys Leu Ser Arg 

10 15 
Gly Asp Leu Lys Gly Cys Ser Glu 
25 30 
Asn Gin Ser Ala Leu Val His Pro 
45 

Gly Gly Pro Leu Leu Gly Thr Leu 
60 



<210> 202 

<211> 73 

<212> PRT 

<213> Horao sapiens 

<400> 202 

Met Thr Pro Glu Lys Leu Arg Thr 

1 5 
Leu Glu Val Gly Trp Leu Ser Glu 
20 

Ser Lys Val Trp His Lys Val Thr 

35 40 
Phe Leu Tyr lie Asp Ser Tyr Ser 

50 55 
Pro Thr Val Val Lys Arg Thr Ala 
65 70- 



Leu 


Cys Glu lie 


Asp 


Trp 


Leu 


Thr 




10 






15 




Glu 


Ser Leu Glu 


Arg 


Ser 


Leu 


Val 


25 






30 






Cys 


Lys Pro Lys 


His 


Pro 


Asp 


Gin 






45 








Trp 


Phe Arg Pro 


Leu 


Pro 


Pro 


Leu 




60 










Ala 













<210> 203 

<211> 2008 

<212> DNA 

<213> Homo sapiens 

<400> 203 

ctccagagac aacttcgcgg tgtggtgaac tctctgagga aaaacacgtg cgtggtaaca 60 
agtgactgag acctagaaat ccaagcgttg gaggtcctga ggccagccta agtcgcttca 120 
aaatggaacg aaggcgtttg cggggttcca ttcagagccg atacatcagc atgagtgtgt 180 
ggacaagccc acggagactt gtggagctgg cagggcagag cctgctgaag gatgaggccc 240 
tggccattgc ccgccctgga gttgctgccc agggagctct tcccgccact cttcatggca 300 
gcctttgacg ggagacacag ccagaccctg aaggcaatgg tgcaggcctg gcccttcacc 360 
tgcctccctc tgggagtgct gatgaaggga caacatcttc acctggagac cttcaaagct 420 
gtgcttgatg gacttgatgt gctccttgcc caggaggttc gccccaggag gtggaaactt 480. 
caagtgctgg atttacggaa gaactctcat caggacttct ggactgtatg gtctggaaac 540 
agggccagtc tgtactcatt tccagagcca gaagcagctc agcccatgac aaagaagcga 600 
aaagtagatg gtttgagcac agaggcagag cagcccttca ttccagtaga ggtgctcgta 660 
gacctgttcc tcaaggaagg tgcctgtgat gaattgttct cctacctcat tgagaaagtg 720 
aagcgaaaga aaaatgtact acgcctgtgc tgtaagaagc tgaagatttt tgcaatgccc 780 
atgcaggata tcaagatgat cctgaaaatg gtgcagctgg actctattga agatttggaa 840 
gtgacttgta cctggaagct acccaccttg gcgaaatttt ctccttacct gggccagatg 900 
attaatctgc gtagactcct cctctcccac atccatgcat cttcctacat ttccccggag 960 
aaggaagagc agtatatcgc ccagttcacc tctcagttcc tcagtctgca gtgcctgcag 1020 
gctctctatg tggactcttt atttttcctt agaggccgcc tggatcagtt gctcaggcac 1080 
gtgatgaacc ccttggaaac cctctcaata actaactgcc ggctttcgga aggggatgtg 1140 
atgcatctgt cccagagtcc cagcgtcagt cagctaagtg tcctgagtct aagtggggtc 1200 
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atgctgaccg atgtaagtcc cgagcccctc caagctctgc tggagagagc ctctgccacc 1260 

ctccaggacc tggtctttga tgagtgtggg atcacggatg atcagctcct tgccctcctg 1320 

ccttccctga gccactgctc ccagcttaca accttaagct tctacgggaa ttccatctcc 1380 

atatctgcct tgcagagtct cctgcagcac ctcatcgggc tgagcaatct gacccacgtg 1440 

ctgtatcctg tccccctgga gagt'tatgag gacatccatg gtaccctcca cctggagagg 1500 

cttgcctatc tgcatgccag gctcagggag ttgctgtgtg agttggggcg gcccagcatg 1560 

gtctggctta gtgccaaccc ctgtcctcac tgtggggaca gaaccttcta tgacccggag 1620 

cccatcctgt gcccctgttt catgcctaac tagctgggtg cacatatcaa atgcttcatt 1680 

ctgcatactt ggacactaaa gccaggatgt gcatgcatct tgaagcaaca aagcagccac 1740 

agtttcagac aaatgttcag tgtgagtgag gaaaacatgt tcagtgagga aaaaacattc 1800 

agacaaatgt tcagtgagga aaaaaagggg aagttgggga taggcagatg ttgacttgag 1860 

gagttaatgt gatctttggg gagatacatc ttatagagtt agaaatagaa tctgaatttc 1920 

taaagggaga ttctggcttg ggaagtacat gtaggagtta atccctgtgt agactgttgt 1980 

aaagaaactg ttgaaaaaaa aaaaaaaa 2008 

<210> 204 
<211> 923 
<212> DNA 
<213> Homo sapiens 



<400> 204 

tgagtttgcc cccttacccc catcccagtg aatatttgca attcctaaag acgtgttttg 60 
attgtcacac ctgggtgggg aacatgctac tggcatctaa tgcatagagg gcagtaatgc 120 
tgctaaacat ctttcaacgc acaggacaga gccccacaaa agagaattat ctagccccaa 180 
atgtccataa cactgctgtt gagaaaacct accgcaggat cttactgggc ttcataggta 240 
agcttgcctt tgttctggct tctgtagata tataaaataa agacactgcc cagtccctcc 300 
ctcaacgtcc cgagccaggg ctcaaggcaa ttccaataac agtagaatga acactaaata 360 
ttgatttcaa aatctcagca actagaagaa tgaccaacca tcctggttgg cctgggactg 420 
tcctagtttt agcattgaaa gtttcaggtt ccaggaaagc cctcaggcct gggctgctgg 480 
tcaccctagc agctgaggga ctcttcaata cagaattagt ctttgtgcac tggagatgaa 540 
tatactttaa tttgtaacat gtgaaaacat ctataaacat ctactgaagc ctgttctgtc 600 
tgcaccgaca ttttcattga gtacggattc ttcctaccag atacagctgc tctacaactt 660 
tcgagggctg gtataaaact agcttttacc tatttttaaa aattacatga atagtaaaaa 720 
cttggattaa cccagtattc gggtattttc aatttccttg ggagcttaga ggacggacaa 780 
ataaaaagat tatttcaaca tcaaatatat gctattgttt acatatgaag ataaccacat 840 
atatgtataa attcaccgtt actttttagc aatactataa aatccaacag aaaaaaatag 900 
catttactaa aaaaaaaaaa aaa 923 

<210> 205 

<211> 1619 

<212> DNA 

<213> Homo sapiens 



<400> 205 

ggcaactttt tgcggattgt 
gtgtgaagaa ttccagctga 
ggtgaacgtt caagacatgt 
ccgcaagtcc tgtgcatcat 
ctgctcccca gggaaactga 
cgggccaagg cccaagaaaa 
caccatcctg ttcctcaaat 
ccaccccctc ctgcattgtt 
gtttcttctg ggtgtccttt 
cctgtgcaaa taatgaaaga 
tttcagtatg tacttgaagg 
cggagtcaag gccaggctgg 
ttttgtaaag cctccagtgt 
gtgagagtga cgttttctta 



tcttgcttcc aggctttgcg 
acaacgactg ctcctccccc 
gtcagaaaga agtgatggag 
cagcggcctg tctcatcgcc 
actcagtttg catcagctgc 
ggggaagttc tgcctcggcc 
tagccctctt ctcggcacac 
cttccagccc tcgcccccaa 
tattctgggt agggagcggg 
gctcggtaaa gcattctgaa 
aaggaggtgg agtgaaagtt 
cagagtcagt ccttagaagt 
ccattccatc cctgatgggg 
gggctggagg gccagttccc 



ctgcaaatcc agtgctacca 60 
gagttcattg tgaattgcac 120 
caaagtgccg ggatcatgta 180 
tctgccgggt accagtcctt 240 
tgcaacaccc ctctttgtaa 300 
ctcaggccag ggctccgcac 360 
tgctgaagct gaaggagatg 420 
ccccccacct ccctgagtga 480 
agtccgtgtt ctcttttgtt 540 
taaattcagc ctgactgaat 600 
cacccccatg tctgtgtaac 660 
cactgaggtg ggcatctgcc 720 
gcatagtttg agactgcaga 780 
actcaaggct ccctcgcttg 840 
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acattcaaac ttcatgctcc tgaaaaccat tctctgcagc agaattggct ggtttcgcgc 900 

ctgagttggg ctctagtgac tcgagactca atgactggga cttagactgg ggctcggcct 960 

cgctctgaaa agtgcttaag aaaatcttct cagttctcct tgcagaggac tggcgccggg 1020 

acgcgaagag caacgggcgc tgcacaaagc gggcgctgtc ggtggtggag tgcgcatgta 1080 

cgcgcaggcg cttctcgtgg ttggcgtgct gcagcgacag gcggcagcac agcaccttgc 1140 

acgaacaccc gccgaaactg ctgcgaggac accgtgtaca ggagcgggtt gatgaccgag 1200 

ctgaggtaga aaaacgtctc cgagaagggg aggaggatca tgtacgcccg gaagtaggac 1260 

ctcgtccagt cgtgcttggg tttggccgca gccatgatcc tccgaatctg gttgggcatc 1320 

cagcatacgg ccaatgtcac aacaatcagc cctgggcaga cacgagcagg agggagagac 1380 

agagaaaaga aaaacacagc atgagaacac agtaaatgaa taaaaccata aaatatttag 1440 

cccctctgtt ctgtgcttac tggccaggaa atggtaccaa tttttcagtg ttggacttga 1500 

cagcttcttt tgccacaagc aagagagaat ttaacactgt ttcaaacccg ggggagttgg 1560 

ctgtgttaaa gaaagaccat taaatgcttt agacagtgta aaaaaaaaaa aaaaaaaaa 1619 

<210> 206 

<211> 2364 

<212> DNA 

<213> Homo sapiens 

<400> 206 

atgcagcatc accaccatca ccacttctcc gacgagaccc tggacaaagt gcccaagtca 60 
gagggctact gtagccgtat cctgcgcgcc cagggcacgc ggcgcgaggg ctacaccgag 120 
ttcagcctcc gcgtggaggg cgaccccgac ttctacaagc cgggaaccag ctaccgcgta 180 
acactttcag ctgctcctcc ctcctacttc agaggattca cattaattgc cctcagagag 240 
aacagagagg gtgataagga agaagaccat gctgggacct tccagatcat agacgaagaa 300 
gaaactcagt ttatgagcaa ttgccctgtt gcagtcactg aaagcactcc acggaggagg 360 
acccggatcc aggtgttttg gatagcacca ccagcgggaa caggctgcgt gattctgaag 420 
gccagcatcg tacaaaaacg cattatttat tttcaagatg agggctctct gaccaagaaa 480 
ctttgtgaac aagattccac atttgatggg gtgactgaca aacccatctt agactgctgt 540 
gcctgcggaa ctgccaagta cagactcaca ttttatggga attggtccga gaagacacac 600 
ccaaaggatt accctcgtcg ggccaaccac tggtctgcga tcatcggagg atcccactcc 660 
aagaattatg tactgtggga atatggagga tatgccagcg aaggcgtcaa acaagttgca 720 
gaattgggct cacccgtgaa aatggaggaa gaaattcgac aacagagtga tgaggtcctc 780 
accgtcatca aagccaaagc ccagtggcca gcctggcagc ctctcaacgt gagagcagca 840 
ccttcagctg aattttccgt ggacagaacg cgccatttaa tgtccttcct gaccatgatg 900 
ggccctagtc ccgactggaa cgtaggctta tctgcagaag atctgtgcac caaggaatgt 960 
ggctgggtcc agaaggtggt gcaagacctg attccctggg acgctggcac cgacagcggg 1020 
gtgacctatg agtcacccaa caaacccacc attccccagg agaaaatccg gcccctgacc 1080 
agcctggacc atcctcagag tcctttctat gacccagagg gtgggtccat cactcaagta 1140 
gccagagttg tcatcgagag aatcgcacgg aagggtgaac aatgcaatat tgtacctgac 1200 
aatgtcgatg atattgtagc tgacctggct ccagaagaga aagatgaaga tgacacccct 1260 
gaaacctgca tctactccaa ctggtcccca tggtccgcct gcagctcctc cacctgtgac 1320 
aaaggcaaga ggatgcgaca gcgcatgctg aaagcacagc tggacctcag cgtcccctgc 1380 
cctgacaccc aggacttcca gccctgcatg ggccctggct gcagtgacga agacggctcc 1440 
acctgcacca tgtccgagtg gatcacctgg tcgccctgca gcatctcctg cggcatgggc 1500 
atgaggtccc gggagaggta tgtgaagcag ttcccggagg acggctccgt gtgcacgctg 1560 
cccactgagg aaacggagaa gtgcacggtc aacgaggagt gctctcccag cagctgcctg 1620 
atgaccgagt ggggcgagtg ggacgagtgc agcgccacct gcggcatggg catgaagaag 1680 
cggcaccgca tgatcaagat gaaccccgca gatggctcca tgtgcaaagc cgagacatca 1740 
caggcagaga agtgcatgat gccagagtgc cacaccatcc catgcttgct gtccccatgg 1800 
tccgagtgga gtgactgcag cgtgacctgc gggaagggca tgcgaacccg acagcggatg 1860 
ctcaagtctc tggcagaact tggagactgc aatgaggatc tggagcaggt ggagaagtgc 1920 
atgctccctg aatgccccat tgactgtgag ctcaccgagt ggtcccagtg gtcggaatgt 1980 
aacaagtcat gtgggaaagg ccacgtgatt cgaacccgga tgatccaaat ggagcctcag 2040 
tttggaggtg caccctgccc agagactgtg cagcgaaaaa agtgccgcat ccgaaaatgc 2100 
cttcgaaatc catccatcca aaagctacgc tggagggagg cccgagagag ccggcggagt 2160 
gagcagctga aggaagagtc tgaaggggag cagttcccag gttgtaggat gcgcccatgg 2220 
acggcctggt cagaatgcac caaactgtgc ggaggtggaa ttcaggaacg ttacatgact 2280 
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gtaaagaaga gattcaaaag ctcccagttt accagctgca aagacaagaa ggagatcaga 2340 
gcatgcaatg ttcatccttg ttag 2364 

<210> 207 
<211> 787 
<212> PRT 

<213> Homo sapiens 
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<210> 208 

<211> 1362 

<212> DNA 

<213> Homo sapiens 
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<400> 208 

atggcttcac ccagcctccc gggcagtgac tgctcccaaa tcattgatca cagtcatgtc 60 

cccgagtttg aggtggccac ctggatcaaa atcaccctta ttctggtgta cctgatcatc 120 

ttcgtgatgg gccttctggg gaacagcgcc accattcggg tcacccaggt gctgcagaag 180 

aaaggatact tgcagaagga ggtgacagac cacatggtga gtttggcttg ctcggacatc 240 

ttggtgttcc tcatcggcat gcccatggag ttctacagca tcatctggaa tcccctgacc 300 

acgtccagct acaccctgtc ctgcaagctg cacactttcc tcttcgaggc ctgcagctac 360 

gctacgctgc tgcacgtgct gacactcagc tttgagcgct acatcgccat ctgtcacccc 420 

ttcaggtaca aggctgtgtc gggaccttgc caggtgaagc tgctgattgg cttcgtctgg 4 80 

gtcacctccg ccctggtggc actgcccttg ctgtttgcca tgggtactga gtaccccctg 540 

gtgaacgtgc ccagccaccg gggtctcact tgcaaccgct ccagcacccg ccaccacgag 600 

cagcccgaga cctccaatat gtccatctgt accaacctct ccagccgctg gaccgtgttc 660 

cagtccagca tcttcggcgc cttcgtggtc tacctcgtgg tcctgctctc cgtagccttc 720 

atgtgctgga acatgatgca ggtgctcatg aaaagccaga agggctcgct ggccgggggc 780 

acgcggcctc cgcagctgag gaagtccgag agcgaagaga gcaggaccgc caggaggcag 840 

accatcatct tcctgaggct gattgttgtg acattggccg tatgctggat gcccaaccag 900 

attcggagga tcatggctgc ggccaaaccc aagcacgact ggacgaggtc ctacttccgg 960 

gcgtacatga tcctcctccc cttctcggag acgtttttct acctcagctc ggtcatcaac 1020 

ccgctcctgt acacggtgtc ctcgcagcag tttcggcggg tgttcgtgca ggtgctgtgc 1080 

tgccgcctgt cgctgcagca cgccaaccac gagaagcgcc tgcgcgtaca tgcgcactcc 1140 

accaccgaca gcgcccgctt tgtgcagcgc ccgttgctct tcgcgtcccg gcgccagtcc 1200 

tctgcaagga gaactgagaa gattttctta agcacttttc agagcgaggc cgagccccag 1260 

tctaagtccc agtcattgag tctcgagtca ctagagccca actcaggcgc gaaaccagcc 1320 

aattctgctg cagagaatgg ttttcaggag catgaagttt ga 1362 

<210> 209 
<211> 453 
<212> PRT 

<213> Homo sapiens 



<400> 209 



Met 


Ala 


Ser 


Pro 


Ser Leu Pro Gly Ser 


Asp 


Cys Ser Gin lie lie Asp 


1 








5 


10 


15 


His 


Ser 


His 


Val 


Pro Glu Phe Glu Val 


Ala 


Thr Trp lie Lys lie Thr 








20 


25 




30 


Leu 


lie 


Leu 


Val 


Tyr Leu lie lie Phe 


Val 


Met Gly Leu Leu Gly Asn 






35 




40 




45 


Ser 


Ala 


Thr 


lie 


Arg Val Thr Gin Val 


Leu 


Gin Lys Lys Gly Tyr Leu 




50 






55 




60 


Gin Lys 


Glu 


Val 


Thr Asp His Met Val 


Ser 


Leu Ala Cys Ser Asp lie 


65 








70 




75 80 


Leu 


Val 


Phe 


Leu 


lie Gly Met Pro Met 


Glu 


Phe Tyr Ser He He Trp 










85 


90 


95 


Asn 


Pro 


Leu 


Thr 


Thr Ser Ser Tyr Thr 


Leu 


Ser Cys Lys Leu His Thr 








100 


105 




110 


Phe 


Leu 


Phe .Glu 


Ala Cys Ser Tyr Ala 


Thr 


Leu Leu His Val Leu Thr 






115 




120 




125 


Leu 


Ser 


Phe 


Glu 


Arg Tyr lie Ala lie 


Cys 


His Pro Phe Arg Tyr Lys 




130 






135 




140 


Ala 


Val 


Ser Gly 


Pro Cys Gin Val Lys 


Leu 


Leu He Gly Phe Val Trp 


145 








150 




155 160 


Val 


Thr 


Ser 


Ala 


Leu Val Ala Leu Pro 


Leu 


Leu Phe Ala Met Gly Thr 










165 


170 


175 


Glu 


Tyr 


Pro 


Leu 


Val Asn Val Pro Ser 


His 


Arg Gly Leu Thr Cys Asn 








180 


185 




190 


Arg 


Ser 


Ser 


Thr 


Arg His His Glu Gin 


Pro 


Glu Thr Ser Asn Met Ser 






195 




200 




205 


lie 


Cys 


Thr 


Asn 


Leu Ser Ser Arg Trp 


Thr 


Val Phe Gin Ser Ser He 
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210 










215 




Phe Gly 


Ala 


Phe 


Val 


Val 


Tvr 


Leu 


225 








230 






Met Cys 


Tro 


Asn 


Met 


Met 


Gin 


Val 








245 








Leu Ala 


Glv 


Glv 


Thr 


Arg 


Pro 


Pro 






260 










Glu Ser 


Ara 


Thr 


Ala 


Arc 


Arg 


Gin 




275 










280 


Val Val 


Thr 


Leu 


Ala 


Val 


Cys 


Tro 


290 










295 




Met Ala 


Ala 


Ala 


Lvs 


Pro 


Lys 


His 


305 








310 






Ala Tyr 


Met 


He 


Leu 


Leu 


Pro 


Phe 








325 








Ser Val 


lie 


Asn 


Pro 


Leu 


Leu 








340 








Arg Val 


Phe 


Val 


Gin 


Val 


Leu 


Cys 




355 










360 


Asn His 


Glu 


Lys 


nx ^ 




Arg 


vax 


370 










375 




Ala Arg 


Phe 


Val 


Gin 


Arg 


Pro 


Leu 


385 








390 






Ser Ala 


Arg 


Arg 


Thr 


Glu 


Lys 


He 








405 








Ala Glu 


Pro 


Gin 


Ser 


Lys 


Ser 


Gin 






420 










Fro Asn 


Ser 


Gly 


Ala 


Lys 


Pro 


Ala 




435 










440 


Gin Glu 


His 


Glu 


Val 









450 



220 



Val 


Val 


Leu 


Leu Ser Val Ala 








235 






Leu 


Met 


Lys 


Ser Gin Lys Gly 


OCX 




250 




255 




Gin 


Leu Arg Lys Ser Glu Ser 


OXU 


265 






9*7(1 




Thr 


He 

A c 




T.oii T 

ciiKi ju6u Axg jueu 


xxts 








£ Ok? 




Met 




Asn 


oi.n ixe Arg Arg 


T1 A 

j-xe 












Asp 


Trp 


Thr 


Aro Ser Tvr Phe 


Arg 






315 






Ser 


Glu 


Thr 


Phe Phe Tyr Leu 






330 




335 




Thr 


Val 


Ser 


Ser Gin Gin Phe 


Arg 


345 






350 


Cys 


Arg 


Leu 


Ser Leu Gin His 


Ala 








365 




His 


Ala 


His 


Ser Thr Thr Asp 


Ser 








380 




Leu 


Phe 


Ala 


Ser Arg Arg Gin 


Ser 






395 




400 


Phe 


Leu 


Ser 


Thr Phe Gin Ser 


Glu 




410 




415 




Ser 


Leu 


Ser 


Leu Glu Ser Leu 


Glu 


425 






430 




Asn 


Ser 


Ala 


Ala Glu Asn Gly 


Phe 



445 



<210> 210 
<211> 625 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 607 

<223> n = A,T,C or G 



<400> 210 

agttctcctt gcagaggact ggcgccggga 
ggcgctgtcg gtggtggagt gcgcatgtac 
cagcgacagg cggcagcaca gcacctgcac 
cgtgtacagg agcgggttga tgaccgagct 
gaggatcatg tacgcccgga agtaggacct 
catgatcctc cgaatctggt tgggcatcca 
tgggcagaca cgagcaggag ggagagacag 
taaatgaata aaaccataaa atatttagcc 
ggtaccaatt tttcagtgtt ggacttgaca 
aacactgttt caaacccggg ggagttggct 
acagtgnaaa aaaaaaaaaa aaaaa 



cgcgaagagc aacgggcgct gcacaaagcg 60 
gcgcaggcgc ttctcgtggt tggcgtgctg 120 
gaacacccgc cgaaactgct gcgaggacac 180 
gaggtagaaa aacgtctccg agaaggggag 240 
cgtccagtcg tgcttgggtt tggccgcagc 300 
gcatacggcc aatgtcacaa caatcagccc 360 
agaaaagaaa aacacagcat gagaacacag 420 
cctctgttct gtgcttactg gccaggaaat 480 
gcttcttttg ccacaagcaa gagagaattt 540 
gtgttaaaga aagaccatta aatgctttag 600 

625 



<210> 211 
<211> 1619 
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<212> DNA 

<213> Homo sapiens 
<400> 211 

ggcaactttt tgcggattgt tcttgcttcc aggctttgcg ctgcaaatcc agtgctacca 60 
gtgtgaagaa ttccagctga acaacgactg ctcctccccc gagttcattg tgaattgcac 120 
ggtgaacgtt caagacatgt gtcagaaaga agtgatggag caaagtgccg ggatcatgta 180 
ccgcaagtcc tgtgcatcat cagcggcctg tctcatcgcc tctgccgggt accagtcctt 240 
ctgctcccca gggaaactga actcagtttg catcagctgc tgcaacaccc ctctttgtaa 300 
cgggccaagg cccaagaaaa ggggaagttc tgcctcggcc ctcaggccag ggctccgcac 360 
caccatcctg ttcctcaaat tagccctctt ctcggcacac tgctgaagct gaaggagatg 420 
ccaccccctc ctgcattgtt cttccagccc tcgcccccaa ccccccacct ccctgagtga 480 
gtttcttctg ggtgtccttt tattctgggt agggagcggg agtccgtgtt ctcttttgtt 540 
cctgtgcaaa taatgaaaga gctcggtaaa gcattctgaa taaattcagc ctgactgaat 600 
tttcagtatg tacttgaagg aaggaggtgg agtgaaagtt cacccccatg tctgtgtaac 660 
cggagtcaag gccaggctgg cagagtcagt ccttagaagt cactgaggtg ggcatctgcc 720 
ttttgtaaag cctccagtgt ccattccatc cctgatgggg gcatagtttg agactgcaga 780 
gtgagagtga cgttttctta gggctggagg gccagttccc actcaaggct ccctcgcttg 840 
acattcaaac ttcatgctcc tgaaaaccat tctctgcagc agaattggct ggtttcgcgc 900 
ctgagttggg ctctagtgac tcgagactca atgactggga cttagactgg ggctcggcct 960 
cgctctgaaa agtgcttaag aaaatcttct cagttctcct tgcagaggac tggcgccggg 1020 
acgcgaagag caacgggcgc tgcacaaagc gggcgctgtc ggtggtggag tgcgcatgta 1080 
cgcgcaggcg cttctcgtgg ttggcgtgct gcagcgacag gcggcagcac agcaccttgc 1140 
acgaacaccc gccgaaactg ctgcgaggac accgtgtaca ggagcgggtt gatgaccgag 1200 
ctgaggtaga aaaacgtctc cgagaagggg aggaggatca tgtacgcccg gaagtaggac 1260 
ctcgtccagt cgtgcttggg tttggccgca gccatgatcc tccgaatctg gttgggcatc 1320 
cagcatacgg ccaatgtcac aacaatcagc cctgggcaga cacgagcagg agggagagac 1380 
agagaaaaga aaaacacagc atgagaacac agtaaatgaa taaaaccata aaatatttag 1440 
cccctctgtt ctgtgcttac tggccaggaa atggtaccaa tttttcagtg ttggacttga 1500 
cagcttcttt tgccacaagc aagagagaat ttaacactgt ttcaaacccg ggggagttgg 1560 
ctgtgttaaa gaaagaccat taaatgcttt agacagtgta aaaaaaaaaa aaaaaaaaa 1619 

<210> 212 
<211> 1010 
<212> DNA 

<213> Homo sapiens 

<400> 212 

ccgcagccgg gagcccgagc gcgggcgatg caggctccgc gagcggcacc tgcggctcct 
ctaagctacg accgtcgtct ccgctggcag cagctgcggg ccccagcagc ctcggcagcc 
acagccgctg cagcctgggg cagcctccgc tgctgtcgcc tcctctgatg cgcttgccct 
ctccctggcc ccgggactcc gggagaatgt gggtcctagg catcgcggca actttttgcg 
gattgttctt gcttccaagg ctttgcgctg caaatccagt gctaccagtg tgaagaattc 
cagctgaaca acgactgctc ctcccccgag ttcattgtga attgcacggt gaacgttcaa 
gacatgtgtc agaaagaagt gatggagcaa agtgccggga tcatgtaccg' caagtcctgt 
gcatcatcag cggcctgtct catcgcctct gccgggtacc agtccttctg ctccccaggg 
aaactgaact cagtttgcat cagctgctgc aacacccctc tttgtaaccg ggccaaggcc 
caagaaaagg ggaagttctg cctcggccct caggccaggg ctccgaacca ccatcctgtc 
cctcaaatta agccctactt ctcggcacac tgctggaagc ttgaagggag aaggcaccca 
ctcctgcata gtccatccag gcctcgcccc acacacccca ctccctgaga gagcacgccc 
agggagacca aaaaccggga taggcaacgg acccccagac accacaaggg acccgaggac 
aaagacgcag acaactcgcg aaagccaccc acgaatacaa cggcccgaac acagatataa 
cgcacgagcc ccgaccgaca agagaagaag cagaagaaac acccacagac agaaacagac 
accagcaaca agcgaaaaca gcaaaacgac actagcgaga caccacctgc acacaacacc 
acagcccaac acagaggaca cgacaacaaa gagacagcac caacgacgaa 

<210> 213 
<211> 480 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1010 
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<212> DNA 

<213> Homo sapiens 



<400> 213 

gccaactccg 

cagccgggag 

agctacgacc 

cgctgcagcc 

gccccgggac 

cttgcttcca 

caacgactgc 

tgagaaagaa 



gaggctctgg 
cccgagcgcg 
gtcgtctccg 
ggggcagcct 
tccgggagaa 
ggctttgcgc 
tcctcccccg 
gtgatggagc 



tgctcggccc 
ggcgatgcag 
cggcagcagc 
ccgctgctgt 
tgtgggtcct 
tgcaaatcca 
agttcattgt 
aaagtgccgg 



gggagcgcga 
gctccgcgag 
gcgggcccca 

cgcctcctct 
aggcatcgcg 
gtgctaccag 
gaattgcacg 
gatcatgtac 



gcgggaggag 
cggcacctgc 
gcagcctcgg 

gatgcgcttg 
gcaacttttt 
tgtgaagaat 
gtgaacgttc 
cgcaagtcct 



cagagacccg 60 
ggctcctcta 120 
cagccacagc 180 
ccctctcccg 240 
gcggattgtt 300 
tccagctgaa 360 
aagacatgtg 420 
gtgcatgatc 480 



<210> 214 

<211> 1897 

<212> DNA 

<213> Homo sapiens 



<400> 214 

gccaactccg 

cagccgggag 

agctacgacc 

cgctgcagcc 

gccccgggac 

cttgcttcca 

caacgactgc 

tcagaaagaa 

agcggcctgt 

ctcagtttgc 

gggaagttct 

agccctcttc 

ttccagccct 

attctgggta 

ctcggtaaag 

aggaggtgga 

agagtcwgtc 

cattccatcc 

ggctggaggg 

gaaaaccatt 

cgagactcaa 

aaatcttctc 

gcacaaagcg 

tggcgtgctg 

gcgaggacac 

agaaggggag 

tggccgcagc 

caatcagccc 

gagaacacag 

gccaggaaat 

gagagaattt 

aatgctttag 



gaggctctgg 
cccgagcgcg 
gtcgtctccg 
ggggcagcct 
tccgggagaa 
ggctttgcgc 
tcctcccccg 
gtgatggagc 
ctcatcgcct 
atcagctgct 
gcctcggccc 
tcggcacact 
cgcccccaac 
gggagcggga 
cattctgaat 
gtgaaagttc 
cttagaagtc 
ctgatggggg 
ccagttccca 
ctctgcagca 
tgactgggac 
agttctcctt 
ggcgctgtcg 
cagcgacagg 
cgtgtacagg 
gaggatcatg 
catgatcctc 
tgggcagaca 
taaatgaata 
ggtaccaatt 
aacactgttt 
acagtgtaaa 



tgctcggccc 
ggcgatgcag 
cggcagcagc 
ccgctgctgt 
tgtgggtcct 
tgcaaatcca 
agttcattgt 
aaagtgccgg 
ctgccgggta 
gcaacacccc 
tcaggccagg 
gctgaagctg 
cccccacctc 
gtccgtgttc 
aaattcagcy 
acccccatgt 
actgaggtgg 
catagtttga 
ctcaaggctc 
gaattggctg 
ttagactggg 
gcagaggact 
gtggtggagt 
cggcagcaca 
agcgggttga 
tacgcccgga 
cgaatctggt 
cgagcaggag 
aaaccataaa 
tttcagtgtt 
caaacccggg 
aaaaaaaaaa 



gggagcgcga 
gctccgcgag 
gcgggcccca 
cgcctcctct 
aggcatcgcg 
gtgctaccag 
gaattgcacg 
gatcatgtac 
ccagtccttc 
tctttgtaac 
gctccgcacc 
aaggagatgc 
cctgagtgag 
tcttttgttc 
tgactgaatt 
ctgtgtaacc 
gcatctgcct 
gactgcagag 
cctcgcttga 
gtttcgcgcc 
gctcggcctc 
ggcgccggga 
gcgcatgtac 
gcacctgcac 
tgaccgagct 
agtaggacct 
tgggcatcca 
ggagagacag 
atatttagcc 
ggacttgaca 
ggagttggct 
aaaaaaa 



gcgggaggag 
cggcacctgc 
gcagcctcgg 
gatgcgcttg 
gcaacttttt 
tgtgaagaat 
gtgaacgttc 
cgcaagtcct 
tgctccccag 
gggccaaggc 
accatcctgt 
caccccctcc 
tttcttctgg 
ctgtgcaaat 
ttcagtatgt 
ggagtcaagg 
tttgtaaagc 
tgagagtgac 
cattcaaact 
tgagttgggc 
gctctgaaaa 
cgcgaagagc 
gcgcaggcgc 
gaacacccgc 
gaggtagaaa 
cgtccagtcg 
gcatacggcc 
agaaaagaaa 
cctctgttct 
gcttcttttg 
gtgttaaaga 



cagagacccg 
ggctcctcta 
cagccacagc 
ccctctcccg 
gcggattgtt 
tccagctgaa 
aagacatgtg 
gtgcatcatc 
ggaaactgaa 
ccaagaaaag 
tcctcaaatt 
tgcattgttc 
gtgtcctttt 
aatgaaagag 
acttgaagga 
ccaggctggc 
ctccagtgtc 
gttttcttag 
tcatgctcct 
tctagtgact 
gtgcttaaga 
aacgggcgct 
ttctcgtggt 
cgaaactgct 
aacgtctccg 
tgcttgggtt 
aatgtcacaa 
aacacagcat 
gtgcttactg 
ccacaagcaa 
aagaccatta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1897 



<210> 215 
<211> 141 
<212> PRT 

<213> Homo sapiens 
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<400> 215 

Met Trp Val Leu Gly lie Ala Ala Thr Phe Cys Gly Leu Phe Leu Leu 

1 5 10 15 

Pro Gly Phe Ala Leu Gin He Gin Cys Tyr Gin Cys Glu Glu Phe Gin 

20 25 30 

Leu Asn Asn Asp Cys Ser Ser Pro Glu .Phe He Val Asn Cys Thr Val 

35 . 40 . 45 

Asn Val Gin Asp Met Cys Gin Lys Glu Val Met Glu Gin Ser Ala Gly 

50 55 60 

He Met Tyr Arg Lys Ser Cys Ala Ser Ser Ala Ala Cys Leu He Ala 
65 70 75 80 

Ser Ala Gly Tyr Gin Ser Phe Cys Ser Pro Gly Lys Leu Asn Ser Val 

85 90 95 

Cys He Ser Cys Cys hsn Thr Pro Leu Cys Asn Gly Pro Arg Pro Lys 

100 105 110 

Lys Arg Gly Ser Ser Ala Ser Ala Leu Arg Pro Gly Leu Arg Thr Thr 

115 120 125 

He Leu Phe Leu Lys Leu Ala Leu Phe Ser. Ala His Cys 
130 135 140 



